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FOREWORD 


The increasing complexity of civilization, with the increasing complexities 
of its societies, economics, and industries, has made urgent the need for 
equal advances in our resources for control, understanding, and coordina- 
tion. In the last twenty years great advances have been made, but they have 
sometimes been technically difficult and not widely understood. With the 
arrival of such technical methods there is always the danger that mere 
technical virtuosity may lose contact with good sense and understanding. 

Van Court Hare’s Systems Analysis admirably achieves its aim of intro- 
ducing the reader to these methods, while basing the teaching on explana- 
tion and understanding rather than on any specialized technique. Formal 
mathematics are almost entirely avoided ( although the reader is invited to 
think in ways that he will later discover to be essentially mathematical). 
The abundance of carefully graded exercises, which start at the most basic 
level, make the book suitable for self-tuition; the projects, for more pro- 
longed work, will be useful in class. 

Though starting from first principles, Professor Hare uses a treatment 
that is essentially modem, for it is based on the general thesis that the 
dominating factor in the complex system is the large quantity of information 
involved. The clear treatment of this aspect in the early chapters leads 
naturally to the full treatment of simplification methods in the later. I 
therefore welcome Van Court Hare’s Systems Analysis as being clear, sound, 
and modem. 


University of Illinois 


W. Ross Ashby 



EDITOR’S FOREWORD 


In The Grapes of Wrath, John Steinbeck wrote, “Man, unlike any other 
thing organic or inorganic in the universe, grows beyond his work, walks 
up the stairs of his concepts, emerges ahead of his accomplishments.*’ The 
total systems concept is such a stairway. Where it will lead, how it will 
change the world as we know it— we shall find this out in less time than many 
think. 

Systems Analysis is one of the finest examples of the “new systems litera- 
ture” for management. This literature tries to cope with the dynamics of 
structure. It emphasizes point and counterpoint, the balance of asymmetrical 
forces, flow and change. 

The universal flux of systems is information. Information can take many 
forms: money, materials, satisfaction, and time. And all these forms can be 
related to one another according to the purpose of a particular system. 

A text in this field must, therefore, employ the opcrationalist’s philoso- 
phy, satisfy the empiricist's standards, and meet the rationalist's require- 
ments. It must foster understanding about the nature of insight. It must 
stress both diagnosis and invention. It must deal abstractly with the form 
and meaning of details. This book achieves these goals. 

In systems analysis we are searching for significant elements and regular 
patterns that have diagnostic validity. Only recently have we recognized that 
to manage complex systems we must understand how knowledge is obtained 
and how knowledge can be transferred. That, in essence, is what the systems 
concept requires. Not just some systematic, analytical procedure; not some 
cookbook approach, no matter how fine the recipes. This text presents a 
framework - for the “new systems approach " which includes the context and 
environment that underlie our understanding of complex organizations. 

What are the attributes of such a framework? It must catalyze valid in- 
sights and support consistent conclusions. It must relate the choice of the 
level of complexity of a system to the problem and the purpose of the sys- 
tem. It must suggest procedures and generate approaches that can cope (not 
overcope or undercope) with the chosen level of complexity. It must high- 
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light connections and interactions so that diagnosis at one point in the sys- 
tem can be quickly related to all relevant connectives at other points in the 
system. It must provide measurable belief and confidence in the approach 
that is being used. It must enable interested parties— especially those who 
supply the data and those who use the results-to describe the system and to 
communicate mi}? one another. It must make it possible to translate the 
different languages of subsystems into a unified expression of the total sys- 
tem. 

These are the attributes that Systems Analysis displays. 

Martin K. Starr 

Columbia University 



PREFACE 


Systems analysis is the selection of elements, relationships, and procedures 
to achieve a specific purpose; some examples include the use of road maps to 
reach a specific city, office procedures to communicate information, and 
equipment combinations to handle a given set of jobs. This text surveys the 
field of systems analysis for the business student. The materials used are also 
applicable to most industrial, military, and administrative systems, and thus 
will be of interest to administrative, engineering, operations research, man- 
agement science, and computer personnel. 

This book attempts to show the reader that systems have several general 
properties and to give him a feeling for systems patterns. It develops skills 
in formalizing systems problems and provides a bridge between functional 
training and the systems requirements of problem formulation, analysis, and 
solution encountered in practice. 

In the past, systems analysis has been treated from specialized view- 
points— such as paperwork, accounting, computer programing, aircraft de- 
sign, and communication circuits— or from highly generalized viewpoints— 
such as philosophy systems, value systems, and pure mathematics. This book 
strikes a middle ground by stressing pattern formulation and diagnosis of 
system operations in a wide range of fields. As such, it will appeal more to 
“problem solvers” than to pure generalists or “how-to-do-it” readers. 

The organization of this book makes it unique in the field of systems 
analysis. Although both specific and general materials are used, the main 
puipose of this textbook is to build basic skills in system definition, analysis, 
and solution of systems problems. Thus, we have the major divisions of Sys- 
tem Definition, Systems Analysis and Diagnosis, and Systems Treatment. 

Class testing of the many examples, illustrations, and problems at Co- 
lumbia University over a period of three years has shown them to stimulate 
student imagination and curiosity in handling independent work, such as 
term projects. 

In Chapters 1 through 5 special attention has been given to system 
definition so that the reader may be adequately prepared for this most crit- 
ical step in systems analysis. The necessity for adjusting the system defini- 
te 
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tion to the analyst’s resources for analysis anfl coni *>h '“S'*” ' *V P 
nropriate simplification procedures, follows in Chapters 6 through 8. Ch. p- 
ters B through 12 contain selected procctlurtrs for systems analysis, lnc ’ u ““S 
external and internal systems tests, search methods, logical and probabilistic 
diagnostic procedures, and systems simulation, and thus provide a range of 
tools for the reader. Problems of systems improvement and the implements- 
ton of system change conclude the treatment in Chapters 13 and 14. 

Topics from the social sciences arc introduced, for example, in Chapters 
8 and 13, to complement quantitative system definitions and forms of anal- 
ysts. Hiis approach is a departure from that taken in traditional systems 
analysis textbooks and provides the reader with strategies for higher-level 
system definition and the diagnosis of implementation problems. Chapter 
10, “Selected Search Problems and Techniques, contains topics not 
previously brought together in one text. 

The reader will derive maximum benefit if he exploits the full range of 
examples, problems, and projects provided. However, should he become in- 
terested in a special topic, he can easily devote his energies to that subject 
through the references provided by the numerous footnotes and extensive 
bibliography. 

The mathematical background assumed includes a first course in proba- 
bility and statistics as well as a familiarity with undergraduate algebra. No 
knowledge of calculus is assumed. Appendices covering basic matrix calcu- 
lations, an introduction to mathematical transformations, and an introduc- 
tion to commercial codes, information measures, and Geld data collection 
are provided for reference. It would be helpful, although not necessary, for 
the reader to have had an introduction to data processing and operations 
research. The usual business school reader will also be familiar with one or 
more functional specialties (accounting, production, marketing, finance, 
etc. ), and this knowledge will also be of use to him here, 

I wish to acknowledge the moral and financial support provided by Dean 
Courtney C, Brown of the Graduate School of Business, Columbia Univer- 
sity » ^ die completion of this work. I am indebted to my colleagues and 
students for innumerable suggestions. Acknowledgments for the use of 
previously published material are made where citations appear. Susan 
Alewon typed much of the manuscript with dispatch. I am particularly in- 
debted to my parents for their initial stimulation and continued support of 
my interest in science and its wider applications, and to my wife, Auwe, for 
her care and patience over an extended writing schedule. Errors may remain 
m even the most diligently checked manuscript, and for these I must assume 
the final responsibility Comments and suggestions for improvement of this 
text wul be appreciated. 


Irdfigfon, New York 


Van Court Hare, Jr. 
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INTRODUCTION 


The study of “systems” is by no means a new pursuit of the human mind. 
The Egyptian architects of Cheops’ tomb relied on a system of measurement 
for its construction. Phoenician astronomers studied a system of the stars, 
and made rough predictions from their observations. Plato and other Greeks 
contemplated a system of society in which philosophers would be kings. 

The development of a set of standards and procedures, or the develop- 
ment of a concept of society, or even a theory of the universe is as old as 
history itself. Man has always sought to find relationships: Satisfactory 
explanations for what he sees, hears, or dreams of. 

Indeed, the history of both the physical and the social sciences has been 
a continuing enlargement upon this theme: Relevant and dependable rela- 
tionships must be found if we are to advance knowledge and successfully 
conduct our affairs. The scientific method of inquiry, which demands such 
relevant and dependable relationships for its results, is systems analysis in 
its broadest sense. 


THE NEW SYSTEMS ANALYSTS 

Today, however, systems analysis and systems analysts are institutionalized 
in many specialized areas of activity, most frequently related to the diag- 



2 introduction 


nosis design, evaluation, end treatment oi complete hardware, information 
Slow, and r&antoUonal systems that east, or arc conceived, to accomplish 
one or mote specific ob[ectivcs. The approach, methods, and tools used, as 
well as the results obtained, are of a kind unfamiliar in the past. 

In such purposeful systems, from radar control devices to the operation 
of a business or a government agency, the investigator finds he must cross 
historical boundaries of knowledge and functions, if he is to achieve major 
improvements in the system’s operation. 

Hardware systems consisting of electrical, mechanical, hydraulic, opti- 
cal, and chemical components are used by human operators, who have their 
own biological, medical, and social properties. To understand, to design, 
or to improve the operation of such a “total system," investigators must 
have available not only knowledge of, say, electrical engineering, but abo 
knowledge of the other disciplines. 

Moreover, to understand and to improve such systems by utilizing the 
relationships that result when diverse components are combined, investi- 
gators must be able to follow and exploit the chain of work Dow, informa- 
tion flow, or material flow— the connecting links that tie the system together. 
And, these connecting links almost always lead across the boundaries of 
educational disapline and organizational function. 

For example, we now have electronic radar detectors based on the 
analysis of a frog's eye. Electronic systems telemeter and record, on the 
ground, an astronaut's heart beat as he orbits 300 miles above the earth. 
Mechanical milling machines automatically produce parts from numerical 
directions given by a human, or a computer, and can store these skills for 
future use. Maser and laser devices, combinations of optics and electronics, 
set time standards to one part in ten billion, make optical stitching of de- 
tached human retinas possible, provide highly directive, wide-band com- 
munication possibilities, have been used in industrial welding and cutting 
equipment, and form the power source for “death ray” rifles, reportedly 
capable of stopping the enemy silently, without firing a shot. These mixed- 
discipline systems depend upon the conscious exploitation of the gaps be- 
tween historical categories of knowledge. 

Similarly, the operations oi a business enterprise require the combina- 
tion of numerous functions-purchasing. production, distribution, research 
and Product design, sales, finance, legal and tax departments, and the like. 
But, the flow of information and material, required to fulfill the purpose of 
the business, by necessity cuts across these historical functions and depart- 
mental boundaries, 

.V*®* w CTed! ' timi ”S a<> •*«* Pactum schedul- 
of IT y if”""? 8 °" , 5' m Wr - nuoU orders. He formulation 

andreJ t pat,erns of <•«*'>»*»>. wholesale 

and retad operates, a „d product profit margins by changing the product 
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mix sold and by changing the sales methods used. The fact that a single 
machine tool can produce short runs of product at virtually no setup cost- 
and that minor modifications of product design can be made by patching 
together sections of paper tape, or feeding a modified formula to a com* 
pu ter— profoundly affects inventory levels, the labor complement to 
be hired, and the type of products we can offer for sale. Automatic as- 
sembly, packing, and order-picking equipment affects the physical layout 
of the factory, warehouse, and office, as well as the capital requirements of 
the firm. These developments also present mixed-discipline, systems analysis 
problems. 

Thus, improvement in the business system for the enterprise os a whole 
requires the investigator to disregard formal boundaries, or to cross them 
at will. 

The same argument holds in the development of the professional fields. 
The patent lawyer must know both engineering and law. The physicist 
Ends a thin line exists between his specialty and chemistry. The designer 
of medical instruments turns into a combination physician and electronics 
expert. The modem corporate controller must have a general knowledge 
of all the basic sciences that affect his company’s business, an ability to 
handle complex mathematics, and a talent for creating new systems. 

In short, in the last ten years, it has become necessary to formalize a 
type of training and educational approach to systems that can generate 
large numbers of individuals and work groups whose purpose is specifically 
to cross forbidden boundaries and to relate diverse operations and special- 
ties toward a definite systems purpose. 

These are the new systems analysts. 


SYSTEMS ANALYSIS AS AN INSTITUTION 

Educators have always attempted to give their students the ability to relate 
and transfer knowledge from one problem to another, so that the student 
could multiply his successes and eliminate his mistakes. However, this has 
usually been done within a functional, or departmental framework at the 
university. Today, to satisfy the clear need for boundary-crossing people, 
most universities offer interdisciplinary degrees and hybrid courses (such as 
bioelectronics, electronic auditing, and mathematical planning), as well 
as courses pointed specifically at systems analysis in business systems and 
hardware systems. 

Such courses in systems analysis are geared both to broaden the student's 
interestin related fields (such as electrical, mechanical, optical, and chemical 
engineering; or production planning, accounting, and electronic data process- 
ing) and to provide him with a number of tools, methods, and strategic 
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approaches and with a philosophy for the analysis of a system os a total 

0P '?X systems analysts are institmionaUaed in academia . Some 120-150 
English-language journals (and another 3W» taeignJangnoge )onmak) 
Itmlatly cany articles at direct or related interest to systems analysts. A 
survey of the engineering, science, and business school catalogues of major 
United States institutions of higher learning shows that 20-301 of the 
university courses offered have a similar content-even if the classic courses 
m mathematics and statistics are excluded! 

This educational trend, uhich hot not yet reached full momentum, is 
backed by the employment of systems analysts in government, business, 


and industry. 

In most organizations of any size, one may find a systems anmyris g roup, 
operation » research group, or long-range-planning group that is working 
on system diagnosis, design, and improvement. Some organizations, such 
as the Bell Telephone System, have made organized efforts of this type 
for 20 years or more, and most government and military' agencies have 
been heavily engaged in systems analysis since World War II. Industry— 
from mining and refining, to brewing, clothing, appliances and amusement— 
has its “in house” groups, plus numerous outside consulting agencies. 

Trom 1966 membership lists of professional societies, such as The 
Operations Research Society of America, The Institute of Electrical and 
Electronic » Engineers, The Association, for Computing Machinery, The 
Systems and Procedures Association, The Ceneral Systems Society, The 
Institute of Management Science and similar organizations, plus various 
special groups in industry and government, we see that professionals en- 
gaged in systems analysis and related activities number no less than 300,000. 

Moreover, technical developments in electronic data processing and 
the development of new theories of systems analysis have made available 
a substantial body of experience that treats systems problems in general. 
So, the efforts of systems analysts are also institutionalized in theory and 
equipment 


For example, Norbert Wieners (1&4S) landmark work: Cybernetics, or 
Communication and Control in the Animal and the Machine, and Claude 
Shannons work on Information Theory set the stage for a consolidation of 
prstem properties in a wide range of disciplines by viewing “systems prob- 
lems as problems of information transfer and control-the essential processes 
in the l auctioning of any organization. 

Other investigators have isolated -patterns* of operation (so* as tie 
toy A queues'!, methods of analysis (inch as dynamic programing), and 

— - 

The advent o! electronic computation (10 million times faster than the 
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mechanical calculator at this writing), the techniques of using stored logic 
program steps in the computer memory to automate problem solution and 
data processing, the computers ability to control other equipment (and 
other computers and communication devices ) on a microsecond, or “real- 
time" basis; and the programers ability to make his machine duplicate, 
or simulate other operations (by means of appropriate logical design) have 
transformed the old methods of “pattern-seeking" and testing into revolu- 
tionized potentials for systems analysis. 1 

Using these theories, or models, we can predict and test system properties 
by computer simulation or mathematical computation before we construct 
actual systems. A newly proposed inventory control or accounting system, 
for example, can be specified, designed, tested, and evaluated before it is 
placed into operation— the effects of the new plan can be checked for possible 
side-effects in other operations of the system, say, production scheduling. 

In addition, installing computers to handle an organization's data process- 
ing chores has furthered the view of an operation as a “total system,” 
because this approach is necessary to gain the maximum efficiency and 
benefit from the expensive hardware. 

This trend toward the “total system approach” in business was noted in 
a Business Week article ( February 29, 1964, p. 85) as follows: 

"Progress has been so great and so fast (in the installation of computers] 
that only those totally involved with computers are capable of keeping track 
of it. Yet future progress will be even greater. 

“The most important evidence of that future progress will almost certainly 
be in the development of the ‘total system.’ 

“Most companies seriously concerned with putting computers to work al- 
ready have a ‘total system’ plan in their files. Generally, its primary goal is to 
shorten die interval between receipt of an order and delivery of a product 

“But a total system, also, is expected to pay large dividends by reducing 
inventory, eliminating rush orders and duplication of effort, improving cus- 
tomer service, improving purchasing through better vendor records, ending 
double-record keeping, and scheduling production-up to and including the 
actual running of the plant. 

‘‘Often, too, a total system involves plans to standardize nomenclature, 
reduce employment fluctuations, keep personnel records, provide engineering 
information services, and give management a simple, but frequent, statement 
or chart on what actually is going on in the company, compared to schedules 
and forecasts." 

The trend toward the “total systems viewpoint,” as it is called in the 
popular literature, has had major effects on the design and the integration 
of operations in many fields, so that the effects of systems analysts are also 
institutionalized in present applications. 

» j n ear ]y J966, a conservative estimate of the computer installations in operation was a 
number reaching 35,200 with an annual growth rate of 10-25S. 
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To take one representative illostration. a DC8-F ear, depart from New 
York as late as 1.00 A.M. and attire on the West Coast before daw P 
viding delivery of merchandise at the beginning of the business day. The 
abilitj- to achieve this feat requires an integrated flow of material from 
dock to air to dock, the use of the principles of automation, and the highly 
organ, zed sorting and handling of packages. One of the foremost examples 
of such a system can be found in United Airline's present installation at 
O’Hare Airport in Chicago- 


“Freight moves automaticaUy on tracks in a system of smooth-flowing dis- 
tribution, sorting and loading Operators in a control room push buttons to 
move freight on a maze of tracks that looks like a giant electric tram set 
The cargo is loaded on large pallets to fit the DC-8F*s 7,508 cubic feet of cabin 
cargo space Motorized transporters quickly carry each pallet from the freight 
terminal to the plane, and special loaders lift it aboard. No manual lifting is 
involved The equipment permits loading 92,000 pounds of palletized freight 
aboard a single jet freighter in 22 minutes, a speed the industry has never 
seen before. (It takes two to four hours to load a piston cargo plane under the 
ordinary method.)" * 


Similarly, some production operations are today entirely controlled by a 
computer control system, devised and implemented by systems analysts. 

At Seadrift, Texas, Union Carbide Olefin's Company’s ethylene plant is 
a production operation in which a computer controls valves and measures 
performance from the first to the last step. The major accomplishment due 
to the speed of response of the integrated control system is less product 
waste, which previously cost about 52 of plant capacity. 


"Faster reaction and ability of a computer to take into consideration several 
interactions at once male it possible to design a plant that will operate at 
better than 991 of theoretical capacity. The 22 to 52 increase in efficiency 
that this represents could mean an increase of as much as 202 in the profit 
potential of the plant, since the extra production would come virtually free . . . 

In addition to the several hundred devices and gauges that sense, measure, 
and correct temperature, pressures, and flow rates, which are all tied into the 
computer, there are on-stream chemical analyzers that deliver a total of 289 
different chemical anal>ses of the product stream from different parts of 
the plant * 


These analyses come in automatically at different intervals-from once 
rn .r tC V° ° nCC W ll ' s for the computer 

o L bU, u "™ ld dr,v ' l ™ to distraction, probably 

to the point they would ignore most of the data ... 

thu is e n^t? Uter Wl13 a ! W3> ? re#d d* 5 ^formation and act immediately. And 
that is one big reason why the plant operates at high efficiency" * 

; LW Uazo-Jnt' Vo! 8. No 3, March 1964 n 9 

'« &r>>Sv”cJ © n" W " i - November 16S3. PP- 
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Wcstinghouse’s Tele-Computing Center in Braddock Hills, just east of 
Pittsburgh is on example of the system analysts approach to improving 
paperwork flow and management control. Two Univac 409 Computers 
tie into a national teletype network that connects 300 sales offices, ware- 
house s, and plants in 178 United States and Canadian cities. The system 
handles 17,000 messages daily, of which about 2000 are orders. Order- 
processing time is 2-3 seconds. The computer and teletype system check 
availability, calculate prices, discounts, and maintain inventory. The 
system prints by remote teletype at the proper warehouse all bills of lading 
and shipping labels. At the end of each day the system provides a record 
of what has been done, produces operating reports on 100 profit centers, 
and prints Wcstinghousc salary', pension, and dividend checks when they 
are needed. Here the improvement in efficiency has also been striking : 
one motor plant alone cut warehouse inventory from $5-million to $1.8* 
million, yet carries a wider range of items than before. 1 

Other current applications can be cited: Orbiting missiles, controlled 
by a complex of man-computer systems; analyses and designs for traffic 
flow; urban redevelopment plans; military logistics developments; auto* 
mated diagnosis of medical symptoms; computer translation of scientific 
documents; the ever-increasing choice of instrumentation and control de- 
vices. But, the previous illustrations are sufficient to make our point. 

The new systems analysts are not only here, they are institutionalized 
in the university, in government, in business, in industry, in the development 
of new theory and equipment, and in their efforts to change the scope and 
complexion of current, on-going operations. They are backed by millions 
of hours of educational effort, by billions of dollars in research, development 
and production funds, by myriad instrumentation, computation, and auto- 
mation devices, and by the full-time thinking of some of the finest minds 
in the world. The scene is different: There has been a revolution in the 
concept, and in the use of systems since the days of Cheops' Tomb. 


SYSTEMS ANALYSIS AND MANAGEMENT THEORY 

The social and technical changes surveyed in the past few pages have had 
a profound effect upon management. The problems faced by the executive 
and the administrator have changed in kind as well as in content. Today, 
the question raised for the manager or supervisor is not so much, “Can 
the job be done?” The technical means to achieve a wide range of objec- 
tives with various degrees of effectiveness (and efficiency) are at hand. 
Bather, the reverse situation often confronts the manager: There are too 
many alternatives to be pursued, too many combinations from which to 
* “Sperry Band; Clearing Skies?" Forbes Magazine, April 1, 1904, pp. 20-24. 



select, too many distractions to confound and confuse the decision-maker, 
too many things that can go wrong with the complex operations that must 
be administered. Today, the questions turn more to “Should the job be 
done?*, “What choice shall be selected?”, and “How can our programs be 
balanced?” 

For example, in considering the management problems to be bandied 
in the Department of Defense, Secretary McNamara has asked just such 
questions. 5 The problems of relevancy to stated objectives, the development 
and choice of clear alternatives, the balance and integration of plans and 
operating subsystems-ail are part of what might be called a “systems 
philosophy of management." 


“Our problems of choice among alternates in strategy and in weapons 
systems base been complicated enormously by the bewildering array of en- 
tirely workable alternative courses which our technology can Support. We 
believe the nanern can afford whatever investment in national security is 
TCCTsary The Moult quotum tVlut tt twitutcdr It u far more difficult 
to tad a deftnto proaram on thts land u( Inundates than it is to set » 
budge «to s and fan squeetc sssto tt whatever ptogtatns >ou can. However 
auncuit, this is exactly what we set out to do 

.1 In <as ' tumpetusg progrtttm and systems ate indeed on the basis 
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The Secretary then continues, explaining the 
methods to aid the judgment of the decision-maker: 
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“I would not, if I could, attempt to substitute analytical techniques for 
judgment based upon experience. The very development and use of those 
techniques have placed an even greater premium on that experience and 
judgment, as issues base been clarified and basic problems exposed to dis- 
passionate examination. The better the factual basis for reflective judgment, 
the better the judgment is likely to be. The need to provide that factual basis' 
is the reason for emphasizing the anal) tical technique.” ' 

Although the Department of Defense is an extreme case in the com- 
plexity of modem management, its problems mirror in exaggerated form 
those of the manager, operator, or designer of the smaller complex system. 

For this reason, the modem manager needs to have available to him a 
theory, structured framework, or organized philosophy of how he will go 
about his job; specifically, how he will formulate his questions and prob- 
lems, and how he will go about answering and solving them in a manner 
that can handle the growing variety of conditions, actions, and choices 
he must counter. 

The key here is variety. And, the purpose of systems analysis, and sys- 
tems analysts, from a management viewpoint, is the management and control 
of variety before variety controls and manages the manager. 

From the management viewpoint, then, systems analysis takes on what 
one may consider a more philosophical tone. It treats problems of the 
complex system at a level of abstraction once removed from immediate 
detail, although requiring selected detail for its execution. It deals with the 
formulation and evaluation of precise alternatives, with the selection of 
activities and their level, with the integration of short- and long-range 
plans, with the specification of values and objectives, and with the im- 
plementation of strategies of diagnosis, trouble-shooting, repair and control. 

These are the overriding decisions that commit organizations to courses 
of action, that control and hold them to a goal until a better one can be 
found. These are questions that bring into play considerations of goal 
changing, value changing, and innovations in policy as changes in tech- 
nology and the environment take place. 

We shall later refer to this “higher level of ccmUoT more extensively. 
However, we should note that the development of ability and power to 
control in the face of a variety of possible conditions requires the develop- 
ment of extensive memory facilities in the organization or system, in 
particular the development of efficient memory functions for the selection, 
storage, retrieval, recombination, and use of operating data, symbols, ana- 
lytic procedures, and reserve plans of action. 

By analogy, the development of systems control ability from a manage- 
ment viewpoint is similar to the development of a supervisory control 
program for a computer-as opposed to the development of a single efficient 

' Ihtd. 
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production program. The supervisory control program, like the human 
supervisor, is concerned with priorities and values, the allocation of space 
and tune, with the selected use of different production programs, sources 
of data, plans, and procedures in response to changes in input conditions. 
The success of the supervisory program is judged by its effectiveness in long- 
range strategic decision making, or selection, rather than in terms of speed, 
day-to-day operating efficiency, or short-range tactics, as might be the 
case in the production program. 

Thus, in making a study of systems analysis, we must cover, for the 
student of business, not only a number of specific systems properties, tech- 
niques, and methods, but also cover what eventually will become his 
major concern, the broader considerations of control at higher levels. We 
must cover a scope of discourse that runs the gamut from simple governors, 
quality control systems and simple multidisripline devices to the problems 
faced by the “total system designer." 



SYSTEMS DEFINITION 
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DEFINING THE SYSTEM: 
A SURVEY 


We define a system when we draw a map. For example, Figure 1-1 is a 
small road map that shows the interstate highways connecting cities A and B 
with those cities Outside our map’s border. The system elements are the 
cities (circles), and the relationships illustrated are the interstate highways 
(arrows). 

Both the elements and the relationships shown in Figure 1-1, however, 
have been carefully selected to achieve a specific purpose — illustration of 
the interstate highway system in the area— and the choice of elements and 
relationships in our simple example is relevant to that end. 



1-1 A simplified road map showing the interstate highway system between two cities 
and other cities outside the border. 
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THE OBJECTIVE OF A SYSTEM DEFINITION 

In defining a system, the matter of relevance is all important. What hap- 
pens withm the boundary of our map could be described in many 
different ways—by a drawing, by equations, by a physical model, or by a 
verba! description. However, unless the elements and relationships to be 
considered within the map’s borders are selected for a specific purpose, 
an infinite number of connections and combinations would be possible. 

Let us elaborate for emphasis. The observer of this landscape could 
paint it, draw a topological map, write equations of the traffic Sow passing 
through it, classify the flora and fauna there, develop a history of the area, 
examine and relate the chemical content of its soil and streams, or male a 
study of how religious belief affects local law, to mention a few of the proj- 
ects that could illustrate some kind of relationship in the scene. But the 
investigator would hardly care to undertake all of these projects at one 
time. Indeed, it would be physically impossible for him to do so. The num- 
ber of combinations of relationships and elements that could be chosen 
rapidly grosses to greater than astronomical proportions even i chert substan - 
ttal restrictions are applied 

Consider the choice o! elements to be related. In our map in Figure 1-1, 
suppose we had 400 political subdivisions instead of two cities and that « 
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1*2 Three of the possible 10'* patterns that can be drawn by placing spots In a 
20 X 20 grid. 


1-3 A map of all the possible connections 
between the two cities shown In Figure 1*1. 

An additional important property of system description also leads* to 
complexities for which we presently have no solution. It is often desirable 
to know if two systems have exactly the same structure, that is, if they are 
isomorphic. This is an important property, because if two systems are iso- 
morphic, what we have learned about the properties of one such system will 
often allow us to make reasonable statements about the properties of the 
other. Yet, proving that two systems are isomorphic can involve us in com- 
binatorial problems such as those mentioned above. 

For example, each graph illustrated in Figure 1-4 has five elements and 




1-4 Two five-node, eight-arc graphs that are Isomorphic. Stephen H. Unger "GIT—A 
Heuristic Program for Testing Parts of Directed Line Graphs for Isomorphism,” in 
Comm. Assoc, ot Computing Machinery 7, 1, Jan. 1964, p. 27. 



1-5 Three nine-node graphs. Which 
pair, If any. Is Isomorphic? Stephen 
H. Unger— op. clt.. p. 27. 

<c) 

eight relationships or nodes and arcs, to use the terminology of graph 
theory. Arc the two systems isomorphic? 

It is difficult to tell even in this simple case, because there are 5! 
possible node pairings that could be considered, and, in general, til for the 
n-nodc graph. By enumeration or inspection, we can show that Figures 
1-ta and 1-lb arc Isomorphic by relabeling Figure l-4b as follows: 1 -*C, 
2 -* D, 3 -* E, 4 -» A, and 5 -»B. But this seeming ease in visualizing similar 
patterns disappears rapidly as the number of nodes increases. (Try to de- 
termine which pair of the nine-node graphs in Figure 1-5, if any, is 
isomorphic.) 

To be e\en more explicit, suppose we bad a computer available to 
enumerate and test our trials at the n-node graph problem, at one millisecond 
per trial. It would taVe about an hour to solve the ten-node problem, and 
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about 40 years to solve a 15-nodc problem by the obvious enumeration 
procedure. 

This short discourse on the complexity and size of combinatorial num- 
bers encountered in practice should convince us that the Erst step in defin- 
ing a system is to specify the purpose and restrictions under which we will 
create our abstraction or formal picture of a given case. Moreover, our 
specification must take into account our ability to handle the types of anal- 
ysis desired ( as illustrated in later chapters ) and should be as simple— in 
the sense of analytical efficiency— as our stated purpose will allow. 

Unfortunately, although the principle of specification and simplification 
is clear, its method of application is not. At the present state of the art no 
theory of system simplification can guarantee that we have abstracted our 
system from the real world correctly, nor that we have simplified our choice 
of elements and relationships in an optimum way for a given purpose. In 
this all-important area of system definition, which commits us to a course 
of investigation and expenditure of effort, we must Tely on ait, experience, 
intuition, hunch, and hypothesis. To make matters worse, no method can 
guarantee (in all cases) that one system description is equivalent to an- 
other; no general way of proving isomorphism is available today. 

Yet, our procedure for defining a system (or a problem to be solved) 
need not be a random process. We can organize our experience, classify our 
examples, and show which intuitions and hypotheses of selection have a 
greater chance of success for each class. We can provide a generalized 
framework for formulating systems descriptions. We can present a range 
of formats for systems definition that have wide application. And, we can 
cite a number of principles of systems simplification and a number of tech- 
niques for the investigator to use in combating system complexity and 
variety. We can also illustrate a number of general systems properties that 
can guide the definition in the specific case. 

Such heuristics 2 form the basis for system definition (and later analysis) 
by turning our approach from a search for “the optimum” or “an optimum 
system definition to One that, with high probability, iwJJ be suitable for our 
purposes. So, accepting this approach, we turn first to an over-all view of 
the system and its components as defined by a series of classical methods. 


THE SYSTEM AS A NETWORK OR FLOW DIAGRAM 

Consider a set of points that represent elements. (These points may be 
called nodes, components, operations, vertices , etc., depending on the 

"The word heuristic comes from the Greek hevrUketn, to discover, and pertains to 
methods serving to stimulate investigation, particularly those methods of demonstra- 
tion which tend to lead a person to investigate further by himself. Io mathematics and 
computer technology, heuristics take the form of rules of experience which aid in 
problem simplification and solution. 
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1.8 A system flow diagram used for definition. 


reader's preference.) Our original collection of elements contains those that 
may be of interest for a given purpose. For example, our elements in an 
industrial system may be work centers, machines, departments, or divisions, 
depending upon the amount of detail needed in later analyses. The ele- 
ments in a mechanical or electrical system may be physical components that 
perform a task and can be isolated. Clearly, some systems easily allow us to 
list their possible elements from physical, logical, organizational, or similar 
considerations. In other systems, the choice must be tentative. 

However, whatever our initial choice, we should assume that we have 
a finite set of elements around which we can draw a clear boundary, as in 
Figure 1-6. 

Next we investigate the relationships among the elements initially 
chosen and indicate these relationships by lines or arrows connecting the 
elements in question. (The relationships shown may be called transactions, 
interactions, transmissions, connections, links, arcs, edges, etc., again de- 
pending on the reader’s preference. ) 

Our choice of the relationships to be shown will be set by our purposes 
in defining the system. For example, if we intend to study the operation of 
an order-processing system, the transactions shown will be the flow of orders 
and the subsequent documents produced. In an electrical system, the con- 
nections may represent the wires in a circuit. In a logical system, the links 
will show the sequence of operations to be performed. Relationships not 
important to our purpose will be omitted within our system boundary, and 
only relationships that affect our system (or will affect other systems) will 
be shown entering or leaving the boundary, as in Figure 1-8. 
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INPUTS AND OUTPUTS 

Thus far, we have said nothing about measuring the relationships or about 
determining the element properties. For the moment, let us set this job 
aside and dwell upon the structure of the system as it now stands. Looking 
at Figure 1-6 in more detail, we see that we have done several things: ( 1 ) 
We defined a boundary that separated the elements we positively wanted 
from others we might have considered; (2) wo selected the elements within 
the boundary that we would consider further; and (3) we selected the re- 
lationships within the boundary that we would consider further, showing 
these few important relationships within the boundary, as well as to and 
from the “outside world” across the boundary’. We have thus far illustrated 
only undirected connections or relationships. 

If wc wished, however, we could indicate with arrowheads the direction 
of the relationships included or the direction of the effects produced by one 
element on another. Such a directed flow* diagram is often necessary to 
describe adequately the operation of the system. 

For example, let us adopt the convention that an arrow emerging from 
element A and entering element B will signify that A affects B, A precedes 
B, A initiates a transaction to B, or As output is an input to B, etc. Con- 
versely, an arrowhead entering an element, say B, will show that B is af- 
fected by A, B follows A, or B’s input is A’s output, etc. Such a convention 
is followed in Figure 1-6. 

This convention indicates that some arrows enter the system boundary, 
and others leave the system boundary. Those arrows that cross the bound- 
ary represent the inputs and outputs to and from the whole system. And, 
to study the over-all operation of the system further, our problem is to 
relate the system's output to the input entering the system. In the order- 
processing system, we will wish to determine what production orders, 
documents, or products leave the boundary when given types of orders 
enter. In the electrical system, we will want to know what action, motion, 
or signal is transmitted from the system in response to various input signals. 

In the logical system, we will wish, to know what results to expect when 
certain types of data or conditions are given as input conditions. 

In short, our later analysis of the system may be concerned with predict- 
ing the outputs from the system, given certain input conditions. To do this, 
we must isolate the inputs and outputs and determine the structure within 
the boundary that may affect our prediction. In other problems, we may 
be given specified input and output conditions for the system, and be asked 
to restructure the internal operations of the system to meet these specifica- 
tions most efficiently. In either problem, the simple flow diagram described 
thus far provides a starting definition. 
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SYSTEM 
TRANSFORMATION 
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1.7 The system Information. *hen summarized by combining internal system 
detail Ttus'is the "bfacK box" transformation o» the system input to the sys em o p . 
as described irt Chapter 2. 


MEASURING RELATIONSHIPS AND ELEMENT PROPERTIES 


At this point, we may ask for more information about the system elements 
and relationships. Generally, the relationship j of the flow diagram can be 
described by an attribute or a variable.' For example, in the order-process- 
ing system, we may want to know the type of document represented by 
the transaction (order, confirmation, release, file copy, etc.), or we may 
want to know some measure of the volume of traasactions per week, the 
time required for the transaction to be completed, the dollar value of die 
transaction, etc In the electrical circuit, the relationship may be a current 
flow, a magnetic coupling, or a similar physical measure. Indeed, a combina- 
tion of variables may be used to describe the relationships involved in as 
much detail as may be necessary for analysis. 

Similarly, the operation of the elements, often called their transforrna - 
tion properties, must be specified in sufficient detail to relate the internal 
inputs to the outputs. In particular, (he transformation performed by an 
element is usually described by the mathematical function, logical opera- 
tion, or process operation that relates predictively an element’s output to 
its input. For example, an order-processing element may convert an order 
into a credit inquiry, an electrical operation may amplify an input signal 
into a larger output sigpal, or a logical operation may compare one input 
signal with another and produce an output (or not), depending upon the 
results. If we know each element input and transformation, we can predict 
the element output. By extension, if the element transformation properties 
arc known, it is possible to compute a prediction of the system output. 

In summary, our notion of the system element can be expanded into the 
quantitative or logical relation of an input set of variables (or attributes) 
to an output set of variables (or attributes) by a transformation, as noted 
in Figure 1-7. 


Even where we do rot wish to measure relationships but only to indi- 
cate their presence or absence, a quantitative notation (1 or 0) is useful to 
indicate structure Similarly, where an element may not represent a pro- 
cess but sunply q decision point or similar reference point in time or space, 
a structural notation using th e (1. 0) convention is often useful for matrix 
and similar presentations. 


'TV* Am) of this description fa discussed la Chapter 3. 
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1*8 A system matrix, showing the same structure as the How diagram In Figure 1-6. 
The conventions used are described In the text. 


MATRIX DESCRIPTIONS OF THE SYSTEM 

Each element in Figure 1*6 has been numbered. If we adopt the conven- 
tion that the presence of an arrow will be indicated by a “l,” and its ab- 
sence by a blank, or “0," we can draw the table, or matrix, of Figure 1-8, 
which summarizes the structure of the system in compact form. 

Standard conventions are used in Figure 1-8. If the element in our sys- 
tem emitting an arrow is designated as i and the element receiving an ar- 
row is designated we number our matrix rows i = 1 , 2, 3 n, where n 

is the total number of elements within the system boundary plus those lew 
outside elements affected by the system outputs and those elements offer- 
ing inputs to the system. Similarly, we number our matrix columns j = 1, 2, 
3, . . . , n obtaining a square table. We understand a “I” in cell (i, /) means 
an arrow goes from element i to element j; a blank means no connection. 

The system matrix representation has some obvious advantages in pro- 
viding rapid information for the investigator. The total number of arrows 
leaving a given element can be obtained by noting the number of entries 
in the given element’s row. The total number of arrows entering an element 
is the total number of entries in that element s column. The element having 
the largest number of both entering and leaving arrows can be found by 
locating that element with the maximum number of entries in its row plus 
its column. The number of two-way transactions between elements can be 
found by isolating all cases for which there is an entry in both cell (i, y) 
and cell (/, i ) that uses the same values for i and j in each case. Similarly, 
elements that receive but do not transmit can be isolated by a simple row- 
column inspection. Other system properties (such as loops) can be ob- 
tained also by inspection of the matrix. So, for ease of display, as well as 
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1-9 Variations ot tho system matrix ol Figure 1-8 to Include additional detail on lire 
relationships between elements In the system, as described In text. 


for theoretical considerations, we will frequently refer to the system matrix 
and its variants. 

Notice also that the matrix display can be used to show measures or 
relationships in either attribute or variable form. Figure l-9a shows an 
attribute display that uses the same type of transaction as that used in 
Figure 1*8 in place of the ( 1, 0) convention For example, the type of trans- 
action is shown in Figure 1-8 by the letter code attached to each arrow, 
where A = order, D - inquiry, C = release, D = confirmation, and 
E = advice. The matrix in Figure l-9a now carries both structural and 
transaction information. Figure l-9b shows a similar display in which num- 
bers indicate the frequency with which transactions occur in a given time 
period; c.g., ten orders enter the system per day on the average. In general, 
the matrix entry may be a complex code, or vector, that carries as much detail 
as necessary on each transaction. 


A SHORT HISTORY OF SYSTEM DESCRIPTIONS 

The general description of the system definition just presented has an in- 
teresting historical base. Because of the selection problem (and its resulting 
combinatorial numbers), descriptions of systems (or models of operations) 
base often conformed to the view of science in vogue at the particular time.* 
In ancient Egypt, the picture of organfrational structure used most often 
referred to the physical structures found in nature or made by man. It is 

Mrf * "" So “ I SctaTO ” ™ c 
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not surprising that organization charts of that time (and since) have shown 
a pyramid of power, with the control centered at the apex of the struc- 
ture. Family •'trees" are a similar “iconic” representation of relationships, 
gained from observations of nature. These were static pictures, however, 
and indicated change only through the revision of the parts of the picture 
as time passed. No short-range variations or day-to-day change in relation- 
ships could be shown. 

With the industrial revolution and its emphasis on machinery and clock- 
work, the picture of a system changed from static to ordered and stable 
motion. The organization, or system, was conceived of as a group of gears 
and pinions. Different theories were set in this framework. The term “a big 
wheel” is still used to imply an organizational element who has many 
forceful relationships with others in liis system. The new concept was 
stable movement; that is, movement which is totally predictable and sub- 
ject to calculation. This was the time not only of wonderful machines but 
also of great interest in accurate prediction and a feeling that the world 
was deterministic. 

The gear-theory view of the world, however, had serious {imitations, 
too. No explanations could show how the unpredictable events (which 
everyone observed) came to pass. There were also no suitable systems de- 
scriptions explaining the strange delays in the scheduled train of events 
predicted by the deterministic theory. 

Harvey's discovery of the correct circulation system of blood in the 
body had its profound effect on the history of system formulation. He 
focused attention on what might be called a “hydraulic concept of ex- 
planation.” Harvey showed that many phenomena could be viewed in terms 
of “flows”— something roughly related to a plumbing system. This view not 
only fit a number of medical explanations, but also was general 
enough to handle explanations of the emerging sciences of electronics and 
hydraulics. Indeed, we still are indebted to this era for many of our con- 
cepts of flow diagrams and charts. 

By an extension of the flow-view of operations to include storage and 
delay, thinkers such as Freud presented a picture of the world that included 
even more possibilities. Here the system had the ability to store responses 
and reflect earlier stimuli in reasonably unpredictable responses at a later 
date. The ability of the system to exhibit variety was enhanced. Freud, 
however, retained some of Harveys strict hydraulic principles: He reasoned 
that inputs could be stored, but, if they were “repressed” beyond the ability 
of the human system, they would exhibit themselves in another, often per- 
verse, form at a later date. This view was still relatively rigid, and for 
prediction left much to be desired. 

Our present socio-psychology of systems definition— the conditioned 
way we select elements and relations we wish to consider— borrows much 
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from this historical background. However, today we usually tend toward 
a world-view largely flavored by recent technical developments in science 
and computation. 

We have come to see that nature holds an inherent element of uncer- 
tainty and chance in most cases of any interest. The act of measurement 
rtself (or an attempt at it) often alters the situation, as Heisenberg’s fa- 
mous principle of uncertainty in physics has shown. Moreover, our knowl- 
edge of exotic devices, electronic computers, and control systems tells us 
that much of our present technology is geared to the construction and op- 
eration of purposeful or goal-seeking ( teleological ) systems. 

In this “cybernetic" 5 view of the world, systems can have still greater 
variety of response to input, although many of the old views of organiza- 
tions and systems remain. If the purposeful system, however, is described 
in a flow diagram, or even a more deterministic form, and yet is imbued 
with the ability to remember, store, and recombine facts-and to any extent 
to control its own future-a new range of description and system definitions 
is opened to us that w as not possible before. 
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C. Comment on (lie observer ’* ability to handle this variety of possibilities. 

1.3 A system is composed of five nodes and four relationships, or arcs. Assuming 
no node is left unconnected, how many different graphs could be drawn if 

A. The nodes are numbered 1, 2, 3, 4, and 5, and the arc directions must 
always progress from a smaller to a larger number. 

B. As in A, but now let the directed relationships proceed in either direction. 

C. In A, how many possibilities are there if the numbered nodes can be 
interchanged at will? 

1.4 A frog sits on a lily pad in a lake. He faces two rows of five lily pads each. 
When the spirit moves Iiim, he jumps to one of the five pads in the first row, 
and then again from where lie is to the next row ahead. 

A. How many paths could the frog take in mosing forward? 

B. How many paths would be possible if the frog faced AT rows of r pads 
each, and if he progressed one row at each jump? 

C. After the second jump in A, suppose the frog could jump both backward 
and forward from row to row. How many paths could he generate in 
five moves? 

D. In C, how many patlis could the frog generate if he could never jump 
to the same pad more than once? 

1.5 We can describe a system by the transactions between operations, or ele- 
ments, as shown in the table below. The value of each transaction is also 
shown. 

A. Draw a flow diagram of this system. 

B. Complete the equivalent system matrix to show all the information in A. 


fhom to Value 

12 6 
2 4 2 

2 3 3 

3 6 4 

4 5 10 

4 6 7 

5 7 3 

6 7 5 


1.6 Subassembly A is composed of one part l’s, one part 2 s, and three part 3 s. 
Subassembly B is composed of two part l's, bo part 2’s, and six part 3 s. 

A- Draw a diagram showing this relationship of parts to subassemblies. 

B. If we need ten subassemblies A and 20 subassemblies B, how many parts 
of each type will we need? 
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C Suppose mo A’s and tinea B’s go into final product I, and one A and 
lour B's go into final product II. How many parts oi each fcnd will be re- 
quired in total to make 100 Is and 200 ITs? 

17 Diagram the sequence of events you follow after arising in the morning 
until the time you reach work (or school). Do you see any repetitive, 
circular, or conditional loops in your diagram? Indicate what you do by 
using a box for an element and showing the sequence of events by arrows. 

1.8 According to Louis Nizer in My Life in Court, a useful technique, he em- 
ploys is to draw connecting bnes with different colored pencils between 
related statements m testimony to check inconsistencies, omissions, and 
plausibility of a group of statements. \See My Life in Court, Doubleday, 
New York, 1981, Pyramid, New York, 1963, pp. 114 — 115. j Could you 
suggest a method of organizing this marked typescript into a table or chart? 

1.9 Make a further study of the history of models and methods of system 
definition by reference to the articles by K. \V. Deutsch, listed in the 
bibliography. Also see the articles by N. Wiener and A. Rosenblueth and 
N. Wiener These references provide a good background for the material 
that follows. 


PROJECT 1 

Read Appendix E, the memorandum describing how to study your hometown. 
Perform such a study of your hometown, or, using similar methods, perform a 
study to define the important relationships in your school or company. The 
purpose of this project is to illustrate the usefulness of a wide range of materials 
in defining an organization or system. 
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DETAILED METHODS 
OF SYSTEM DEFINITION: 
THE BLOCK DIAGRAM 


Figure 2-1 shows three building blocks that define a system in more detail. 

The blocks shown indicate the three major system operation types. Fig- 
ure 2-la represents a single transformation, or conversion operation. Figure 
2-lb represents the application of a decision rule, or a logical operation. 
Figure 2-lc represents an observation phase, or correction operation, in 
which an operation's output is fed back for comparison with an operat- 
ing standard. Several transformations and logical operations can be com- 
bined to produce (or to describe) a more complex system; the feedback 
control principle can be applied to groups of transformations or logical 
operations. The versatility of each of these three basic operations can be en- 
hanced by the use of “memory.” 

In this chapter we outline methods for improving our understanding of 
these basic operations and illustrate how such operations can be combined. 
We will see the block diagram approach in this chapter, but in Chapter 3 
we will describe alternate methods that are useful in special cases. 


THE TRANSFORMATION OPERATION 

To improve the definition of a system over a structural description, we ask 
for the details of component operation. For example, if an order is received 
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2*1 Threw bul’ding blocks that can b« used to construct complex systems, (a) Is the 
trars'om-ailon block that converts an Input t Into an output y by stated conversion 
lutes, ttsns'ormatlon block may also be used to Indicate a sequence ot steps In a 
process, (b) Chows two lorms commonty used to Indicate a decision block; the Input X 
ki tested, and produces either action y or action r (c) Shows tho common feedback 
cpers'ion, wTWn the dotted tines, the input x Is modified as a function of the ouput y. 
An ana*yvs of the feedback block Is tett to Figure 2-10. 

and an accounting re-port is issued, we may ash: "What was done to the 
order to obtain the accounting report?" We inquire into the details of the 
conversion. TVv- steps may be simple, or complex, but if the transforma- 
tion rules can lx- determined, sse can predict what will happen to an input. 

Some examples of tV conversion operation arc: 

1. The order department receives an order, extends the items ordered 
by unit price and totals to obtain lb** total order value. This total goes 
to the accounting department for a credit cheet. 

Z IV coot freshes 100 pounds of flour, 50 pounds of sugar, five dorm 
eggs, and trn pounds of butter eath day. From ibis he males ten 
cilet and ten loaves of Vrad each day. 
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3. The component amplifies an electronic signal 10,000 times when the 
input signal is 2000 cycles per second. 

4. The operation is to find the output y from the input x, where 
tj = ax 1 + bx + c. 

The description of the transformation may be more or less detailed, 
stated in mathematics or in English, but the purpose of specifying what 
conversion takes place remains the same: Given the stated inputs, we must 
know enough to describe what kind of output will occur. 

The Black Box Concept 

Note that in the above examples we said nothing about exactly how the 
conversion was to be made. The accounting report could be produced by 
hand or by a computer, by a single individual or by a group. The cook 
could bake his cakes and bread in a gas or an electric oven. The electronic 
amplifier could be made of tubes, transistors, or some other components. 
Finally, the mathematical operation, the most general description, could 
be performed in various ways: by hand, by an analog computer or digital 
computer, or by any physical operation one could describe by a quadratic 
equation, say, the position of an object after a time x. 

Thus, we can think of the transformation as a black box, which repre- 
sents a grouping of detail. The detail is so grouped either because we do 
not choose to deal with further detail, or because we are unable to pene- 
trate the black box boundary. 

Example: If we wished to know in complete detail why a particular dog 
wags his tail, we would need to destroy the dog to trace his nerve struc- 
ture, to cite one form of analysis. This, however, would still not explain 
in complete detail why the dog wagged his tail. For the dogs owner, a 
black box approach is entirely satisfactory: “My dog wags his tail when 
I give him a bone.” 

How We Define a Black Box 

A black box, as illustrated in the dog example, is really a system of infinite 
detail that encompasses the atomic structure of the dog’s protein molecules 
and beyond. But, because of our restricted purpose— simply to make the 
dog’s tail wag— we treat the dog as a block like Figure 2-la. In other words, 
at some level of refinement, we say “enough!” and lump together what we 
do not know or care to discover. 

The definition of a black box, then, is “operational” or “behavioristic." 

We do not ask in ultimate detail how a transformation takes place, but 
rather observe what does take place. If the input-output relationship ob- 
served is sufficiently stable to offer reliable prediction, then that reliable 
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relationship is taken as the Mack box trensfatmtiots. The pterion Fora 
suitable black box definition is the same as that for a system definition. The 
output must be predictable, within the limits of our need, from the input. 

The term black box, incidentally, originated in electrical engineering 
where many physical black boxes contained electronic components. (Black 
was the popular color for instrument panels before and during World War 
II.) It became convenient to provide electronic boxes with input-output 
specifications, connection instructions, and performance tests that did not 
require dismantling the hardware, not n the term and the method o f . toeatinfe 
components solely by their input-output relationships have remained and 
have spread to other fields. A. similar trend has occuned in science, but for 
a different reason. “Operationahsm," in physics, and ‘behaviorism,’' the 
corresponding approach in the social sciences, attempted to free science 
from ambiguous terms and from explanations that invoked supernatural 
beings to rescue the analyst from apparently mystical phenomena. It became 
popular to say that if the term could not be measured, it was undefined. Or, 
in our case, if no predictable relationship could be measured, the black box 
transformation could not be defined.* We shall adopt this view, indicating, 
where necessary, criticisms of the approach. 


How V/o Develop the Transformation 

Blackbox transformations come to out knowledge in three ways: 

By specification 

If the operation is well-known, say it is a machine for a known opera- 
tion, then the specifications for operation may have been developed by 
others. In this case we would be fortunate to have catalogues, where we 
could find our component transforms at the turn of a page. At this level, the 
component block is clearly defined as is the transformation. 


By analogy, similarity, and modification 

a>o„!d (he system components be less obviously circumscribed, it may 
yet be possible to notice processes o( a iamilia, type-common inventory, 
bottleneck, allocator,, or search processes to list a few business examples. 
Here we may be guided by existing theoretical descriptions, or speciEca- 
tiom for a class of processes. Although the transformations suggested by the 

’ W b h™a' b S , ,lS^'Ti sV ‘ '? * “sloriol note on tie tern, 

leypttavicY, which wis. wisttert The W ll v t " t0 , contaIn the proof* of his 

Wvn and unknowable." S for v ' hose Content* are quite un- 
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more general theory' may not be suitable for a particular block, the theory 
can often be modified suitably with less effort than if we start from scratch. 
Thus, if a block transformation generally involves the resolution of input- 
output priority problems, we may find assistance in waiting-line, or queuing, 
theory, and by modification of a genera! formula or “model” develop the 
particular transformation we need. 

Observation and experiment 

If we know nothing about the black box we have no choice but to ex- 
periment. This is done by observing various combinations of inputs and 
outputs, by recording them in order, and by attempting to deduce what 
relationship may adequately describe what we see. 

Example: Sometimes we arc totally ignorant of what a black box opera- 
tion does, or in fact, which connections are inputs and which are outputs. 
In this case the combinations possible for testing inputs will not allow 
us to learn much about the box unless we are lucky and select an input 
for trial that produces a useful or interesting output. More likely we 
may know something about the box, its general class of performance, 
and the material we want to measure. To return to the order-process- 
ing example, we may not know what the department does exactly, but 
we can get a good idea by sending in a trial order and capturing the 
documents that relate to our test. With a few such tests orders, we can 
narrow the range of usual actions to those needed for prediction. 

Experimental tests may also be necessary to confirm the black box trans- 
formations developed from theoretical considerations. Obviously, the more 
we know about the black box operation, the more accurately we can select 
experimental inputs, so any knowledge of possible theory is an aid in experi- 
mental determination of a black box transformation. 

Black Box Assumptions 

Let us now suppose we have attained a satisfactory black box transforma- 
tion. What assumptions have been made? And, what assumptions must we 
make when combining black boxes? Three considerations are important: 

Continued stability of transformation 

The behavioral or operational definition of a black box (which rests only 
on observations of input-output relationships) assumes that, at least for 
predictions, the box must operate in the future as it has in past recorded 
observations. In short, we assume the black box has a fixed memory, that it 
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cannot leam, and, specifically, that its internal operations will not change 
with time or strange combinations of input conditions we have not con- 
sidered in our experimental trials. 


Example: The classic example is a land mine set to go off after N jolts. 
The experimenter who applies N —l jolts may draw a deadly conclu- 
sion from his observations. He will surely be surprised when he makes 
the Nth test. The faithful cashier who after thirty years of trust and duty 
purloins the bank deposits represents the human analog. Metal beams 
have a similar ability to change with time and a variety of cumulative 
or fortuitous inputs: The aircraft engineer or bridge builder may 
treat structural members as black boxes when computing working loads 
and stresses, but even metal can fatigue, crystallize, and crack. 


We are, of necessity, driven to black box thinking in the affairs of every- 
day life-even if we had no scientific theory of operationalism. We sit on a 
chair svuhout thinking of its atomic structure, and make deposits and 
withdrawals at the bank without concern about the moral habits of the 
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because the boxes are not independent: Coupling the two boxes produces 
a change in the individual transformations. 

For example, if the second box “loads” the first, or vice versa, the 
operation may be altered. Such effects are common in physical measurement 
situations in which the measuring instrument when attached to the device 
under measurement actually changes its transformation. Investigations of 
the stock market often change the market. The job may exceed the capacity 
of the worker. One machine may jam the operation of another. 

Dependence, of course, is not always bad. Two carpenters working 
together can do the work of three or more men working alone, because 
one man can cut and hold while the other levels and nails. The division 
of labor gains its economies from the change in individual transformations, 
or methods of work, which are possible when individuals work as organized 
groups— a result we would not see from a straightforward combination of 
individual black box transformations. 

When we combine black boxes, we may find “emerging” variables that 
are not apparent from examination of the individual parts. Chemical 
examples abound: Carbon, oxygen, and hydrogen have no particular taste 
in themselves, but in combination they can produce the sweet taste of 
sugar. In business, two departments when combined or related in a given 
sequence may produce emerging variables, good or bad, such as new 
product ideas, a union, or a new bowling team. 

Again, for the combination of effects noted in this section, we might 
find an explanation if the original black boxes had been examined in greater 
detail. There is little doubt in the minds of most scientists that if they 
completely understood the amino acids that make up living protein, they 
could explain and synthesize living matter. 

Compatibility of outputs to inputs 

If we sav that the output of one black box is compatible with the input 
requirements of the second, we mean that such an output is "suitable” for 
transformation by the second black box. The second black box, for example, 
must be able to handle the variety of the first box's output unless some 
loss of performance (and therefore some change in the over-all transforma- 
tion of the two boxes ) is not to result. We can only fill orders for items 
on our product list, unless we make some change in our transformation. 
The customer’s output (his order) must be compatible with our input 
requirements (product line). We shall discuss this problem further in 
Chapter 6. 

Combining Black Box Transformations 

The method for combining stable, independent, and compatible black boxes 
is straightforward. The coupling between the blocks is specified, so that a 
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consistent transfer of outputs to inputs is made. The result is » ktock 
diagram with a starting point for inputs) and a terminating point { ot 
outputs). If we ha\e made the combination process in this way, the re- 
quired couplings and connections will also be complete; i.e., we will have 
no missing links. The original structure that wt* observed in the flow 
diagram and matrix of Chapter 1 provides this connection information. 
The arrows m such a block flow diagram show the cartahtc* that connect 
one operation with another, the operations, or Mock transformations may 
be inserted when the arrows connect, i.e., where the elements were shown 
tn Chapter 1. 

The rcsultingblock diagram shows-. 

1 The inputs to the system, 

2 The outputs from the system, 

3 The exact sequence of operations that take place between these 
terminals, and 

4 The transformation of variables that takes place at each element, or 
operation. 

Such a block diagram, may itself be a sutfiefent system description. This 
is so because, given a particular set of inputs, it is now possible to trace 
through the block, diagram and compute the outputs. A computer flow 
diagram is no more than a block flow diagram. (See Figure 2-2.) In what 
follows, however, we observe a special definition which is often useful, the 
transfer function of Figure 2-2a 
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The transfer function 

If the system has a single input and a single output, we find it con- 
venient to reduce die over-all flow' diagram to an equivalent black box. 

\\ e can do this most easily by defining the black box transfer function 
(from the transformation) as the ratio of the black box output to its input. 
Example: If y = kr is the number of dollars of an item ordered in 
quantity x, when the unit price is k, i.e., a transformation of units to 
dollars, then the cqtmalcnt transfer function tjfx = k (dollars/unit) is 
used to describe the black box- 

Rules for black box combination 

If we follow the definition of the transfer function for a black box, we 
obtain the output of a black box by multiplying its input by its transfer 
function. The same principle applies, by extension, to combinations of black 
boxes. See Figure 2-3. 

1. Multiplication rule: If two blocks operate in senes, we multiply the 
individual transfer functions of the boxes to obtain the equivalent oxer-all 
transfer function for the nexv system The same rule applies to more than 
two black boxes in senes ( Figure 2-3a). 

Example: Black box A has the transfer function tj/x = K,. Black box B 
has the transfer function z/y = K,. Then, the over all transfer function 
of both boxes is 

= — — KjKi 

x 

and the system may now be described by the single black box having the 
above transfer function. 

2. Addition rule: When two black boxes operate in parallel, an addition 
or subtraction of variables will be required to obtain the over-all transfer 
function, which is defined for the ratio of output/input (Figure 2-3b). 

Example: An input variable x is split into two equal signals to be 
operated upon separately and then recombined by addition to get an 
output variable. The situation is shown in Figure 2-3b. Here we follow 
the convention that the variable on all solid connecting lines will be 
the same, but that if different variables are to be combined, the summing 
point box is required. The combination details are shown in the figure. 
Note that the result shown is equivalent to addition of the parallel black 
box transforms at the summation point. 

3. Transfer functions of selected output! input pairs: Although the classi- 
cal black box transfer function is defined only for a single black box 
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tain a given output of many dimensions, or attributes, matrix and vector 
notation may be used. In such instances the above rules for combination 
stiU hold, but die rules for matrix multiplication and addition must be 
used. 

Example: A black box describes a cost accounting operation in which 
three parts with costs Xi, x*, and x 3 , respectively, are to be combined 
into two assemblies, with total costs y, and y r , respectively. The table 
showing the required combination of parts in each assembly is the black 
box transform in matrix format, where the entries in column 1 are the 
numbers of parts of type 1, 2, and 3 required in assembly 1, etc. As 
shown in Figure 2-4, the value of each finished assembly is the output 
vector [t/i = 11, i ^ = 17J given by multiplying the input vector by the 
matrix transform (in that order). 

For a summary of matrix operations, see Appendix A. The rules for 
matrix operations arc an extension of the rules given above, but allow great 
simplification in system notation, and therefore should be familiar to the 
reader. 


Transformation with Time Delay 

In our discussion of black box transformations in the previous sections, we 
assumed that the transformation proceeds without delay. This resulted in a 


Vector x 


Matrix T 


Vector/ 


Part 

Type 

12 3 

Cost Each 3 4 1 

Vector x 


Part Product 
Type Type 

Lab 

:nr 


X 2 2 3 


3 0 2 


Matrix 7 


Product 
Type 
A B 


11 


17 


Cost Each 


Vector y 


2-4 Transformation of vector inputs. An input variable x may consist of several 
classifications, or have several attributes. In this case, the Input is a vector, as 
Illustrated by cost Information for three part types. The conversion operation called 
for is to transform part cost information Into product cost information, as specified 
by the matrix, or table, which gives the number of parts of each kind In each 
product made. Multiplication of the vector x by the matrix 7 {in that order) gives the 
required result, the vector y with components representing the unit cost of each 
product. The operation of matrix multiplication is described In Appendix A. 
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simple algebraic treatment However, most transformations of interest do 
show a delay: It takes time to get a given job done. When time is a factor, 
we cannot directly employ the algebraic methods of this section to define 
and combine black box transfer functions. However, the approach is per- 
fectly general: By suitable application of the methods of Appendix B, it is 
possible to reduce the time-dependent case for a linear system to a simple 
algebraic equivalent, which may be treated by the methods discussed. 
Moreover, if the transformation is to be applied periodically, rather than 
continuously (or if a numerical approximation can be used) it is also possi- 
ble to use the numerical procedure given on p. 43. We shall leave further 
discussion of the method to that section. 


In preparation for later chapters, we should also note that any time delay 
m transforming an input variable into an output variable implies the black 
box exhibits some -memory.- If the order-processing department receives 
an order and must transform units-ordered into total price by a unit-price 
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2. If you have not finished the assigned task, continue to work until you 
have. 

3. If the percent of defective product is greater than 5%, use plan B; 
otherwise use plan A. 

4. When the clock reads exactly 01:36, detonate the bomb. 

We see that a choice is made between one route of action and another. 
The decision box makes a simple choice, called a conditional branch in 
computer flow diagrams. 

Loops 

Figure 2-5 shows a special case of the decision box application that is 
particularly important in computer programing and machine design. It 
represents the compute-tally-tcst sequence, which can multiply the effec- 
tiveness of simple instructions. 

Example: Suppose we have 100 numbers to total. Rather than give 100 
instructions to the system to add, requiring 100 separate transformations, 
we give one add instruction, make the first addition, tally ( increment or 
decrement a counter), and test to see if we have added 100 numbers. 
If so, we stop; if not, we loop back to the add instruction and repeat the 
sequence until 100 numbers have been added. Obviously, we can test 
for any given number of cycles that we may care to store in a memory 
device, and so the method is perfectly general. 

In other cases, we may compute and test against some criteria stored 
in a memory device, then stop when the criteria have been met. Thus, we 
may continue to add until the sum of all numbers accumulated reaches 5000. 
The criteria can also change with time. 

Combinations of counters, tests, and operations can be made. The use 
of various memory devices for storing test criteria and for performing opera- 
tions to change test criteria will increase greatly the versatility of the system, 
as we shall see further in Chapter 4 and 5. 

Decision Trees and Multiway Branches 

The decision box can also be used in multiple sequences to provide alternate 
decisions for some input conditions. Sorting, classification, and ordering 
operations require such combinations of steps. 

Example: The input to a system can be 1, 2, 3, or 4. We wish to use plan 
A if the input is 1, plan B for 2, plan C for 3, and plan D for 4. Figure 2-6, 
where the input is x, shows a tree that will make the required 
discrimination. 
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2-5 Combining operations and tests In a loop. When a system Is to perform repetitive 
steps, it Is not necessary to repeat the Instructions for completing the total sequence 
of operations. The required number of operations may be obtained by a loop-and-test 
sequence which Is ol great Importance In simplification of system design. Two typical 
loop-and-test sequences are Illustrated, (a) A system that counts. The operation stops 
after 100 computations have been made, (b) A system that tests a given value In 
memory The operation stops when the result of the computations produces an ac- 
cumulated total greater than or equal to 5000. The tests terminating a loop may b® 
as complicated as necessary for a given job. 


Decision trees and transformation boxes can also be combined to pro- 
duce many useful systems. 

Exempli: A vending machine sells soft drinlcs for 15?. The machine will 
accept quarters, dimes, and nickels, but not pennies. The slot is too sin all 
for half dollars. Draw a logic diagram using simple two-way decision 
boxes to show a possible design for such a coin device. Figure 2-7 shows 
the result, but other diagrams may achieve the same result and provide 
side benefits as well. Trace what happens in the logic diagram for various 
combinations of nickels, dimes, and quarters. Note that transformation 
boxes and logic boxes have been combined in the diagram. 







The Correction Operation: The Feedback Box 


41 



2-6 A multiway branch lor logical selection or matching. The multiway branch or 
"tree of decisions” shown will accomplish the required matching of plan selected to 
Input condition. In many logical systems, extensive branching of this type Is used. 
The addition of looping, as Illustrated In Figure 2-5, greatly extends the usefulness 
of this device. 

Combining Logical Operations 

Given a flow diagram showing a number of black box transformations and 
logical operations, we see that it is usually inconvenient or undesirable to 
reduce the diagram to an equivalent black box. 

It may, however, be desirable for us to summarize the logical tests that 
must be passed by an input to produce a given output, as in complex sorting 
operations. Decision Tables are convenient display forms for doing this. 

Decision tables 

A tabular summary of the tests in a system, the rules for reaching a 
decision, and the action to be taken in each alternative may be useful in 
checking or describing the operation of the system. Figure 2-8 shows such 
a table for the coin-changer example. Decision tables are also valuable in 
data analysis work to describe the action taken by the system for a list of 
possible inputs; or, the decision table may tell what inputs to give a system 
to get a desired result. 

THE CORRECTION OPERATION: THE FEEDBACK BOX 

The third major type of block is a comparison and control box, called the 
feedback block. Again, this block may be constructed by an appropriate 
specialization of the transformation block, or transformation and logic blocks, 
but because of its importance, it is convenient to establish the distinction. 
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2-7 Logical flow diagram of a vending machine that accepts nickels, dimes, and 
quarters. The product cost Is 15$. The diagram assumes the machine la not empty 
and that adequate change Is available. No coin return is shown. 


Feedback 

The general principle of feedback was depicted in Figure 2-lc. Basically, 
an input to the system is received and operated on by a transformation 
Hock. The results of the transformation ate then monitored, and by suitable 
procedures the output is compared to the input standard. Any mors are 
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2-8 A decision table for the vending machine of Figure 2.7. The specific tests con- 
ducted are shown with the specific actions to be taken for each test combination made 
by the machine. Ten specific actions are taken by the machine. Several tests in com- 
bination produce the twelve columns of the table. Y stands for yes, N for no, and — 
indicates either yes or no. X means the action indicated Is to be taken. The test com- 
binations proceed from 1-12, as may be seen in the flow chart. 


noted, and the error is then used by suitable procedures to correct the out- 
put on the following cycle. Such feedback may be continuous or periodic. 
It may reduce the error or increase it, depending upon the design of the 
system and its purpose. Feedback may be applied to transformation or 
logical blocks; but the operation of the feedback principle always requires 
a "loop’ of action, as in Figure 2-lc. 

Example: The classic example of a physical system that uses feedback 
is the “ball governor” invented by James Watt to control the speed of 
steam engines. (See Figure 2-9.) In this device the output speed of an 
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2-9 An early feedback device, the steam governor of James Watt. The action ol thl 
device is described in detail In the text. Simply stated, as the steam engine goes taster, 
the action of the device 13 to reduce the steam input, thereby slowing the 
The purpose of the device is to maintain constant engine speed under varying loao 
conditions 

engine rotates the shaft A, which causes the balls B to spread, raising 
the floating collar C and the weight D. The weight D provides a down- 
ward force to counter the lifting force provided by the governor as the 
balls spread. The equilibrium speed of the engine can be selected by 
changing the size of the weight D or by moving its position on the lever 
FC. Should the engine run too fast, the weight D is lifted, transmitting 
a movement to steam valve F (via the fixed pivot E), to shut off the 
steam and slow the engine. If the steam engine should run slower than 
desired (say svhen a load is applied), the process is r evened, the steam 
valve Opened, and the speed increased. The thermostat is another classic 
example. Inventory control, forecasting, quality control, inspection, and 
similar management control procedures require the application of 
feedhack. 

It is impossible to discuss in this introduction all the variations of the 
feedback principle or its many applications. Consequently, in what follows 
we will illustrate the basic principles of application and denote the three 
factors which are always important in feedback systems: (I) the effective 
‘‘gain" of the system, or its sensitivity, (2) the amount of feedback used, (3) 
the sign of the error correction applied, which is most often influenced by 
the lag or time delay between the time an output is measured and the time 
corrective action is applied. 
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Feedback Without Lag 

To avoid complications at the outset, let us first assume that we have a 
continuous feedback system, as shown in Figure 2-10 in which no lag or 
time delay is introduced by the transformation box K or the feedback box 
b. This assumption allows a simple algebraic treatment of the problem. 

Negative feedback, no tag 

If part of the output from the black box K (b in the figure) can be ad- 
justed to be negative when the input to the system is positive (and vice 
versa), we have the basic case of the feedback principle in which negative 
feedback is applied. Under the ideal conditions of no lag or time delay in 
applying corrective action, and with the negative or counteracting action, 
negative feedback will stabilize the operation of the system; i.e., the output 
of the system can be held relatively constant even though various external 
disturbances may tend to alter the transfer function K. 

Equivalent black box transfer function for a feedback loop 

The result cited above can be seen by reference to the equations de- 
veloped in Figure 2-10, where the equivalent transfer function for a feed- 



: x - 1 + bK 


2-10 A block diagram of a simple feedback device showing the modification In 
system transfer function resulting from feedback. The input x is modified by a 
portion of the output (-by), so that the output with feedback is also modified as 
shown. Without feedback, provided by the fower bfock, the output would be y—Kx. 
With feedback as shown, the output is y=K{x — by). The transfer function for the 
system enclosed in dotted lines is 7= (y/x) = K/(f + bK). The effect is to hold the 
output constant If the transfer function K of the upper box should change due to 
foad or other environmental conditions. This illustration assumes that the process 
proceeds without delay. The system with delay is discussed In the text. 
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back loop is developed in standard form. In the derivation, an input x 
arrives, and without feedback would be transformed by the transfer func- 
tion K into an output tj = Kx, i.e., if the box b were omitted from the 
diagram or temporarily disconnected But, with negative feedback (which 
is assumed here), some portion of y, namely, by, is fed back with a sign 
opposite to that of x. The transfer function of the forward path is K; the 
transfer function of the backward path is — b When combined, they yield 
the equivalent transfer function for the combination; i.e., y/x = fC/( 1 + bK), 
which follows from 3n algebraic rearrangement of terms. 

Thus, the eqimalent transfer function of a feedback system can be found 
by Erst finding die transfer function of the forward loop and the transfer 
function of the backward loop separately, then combining them by the 
standard method illustrated. By applying the methods given on p. 35, 
the forward and backward loops may consist of a number of black boxes, 
which can be reduced to the standard arrangement shown in Figure 2-10. 


Example: To indicate the effects of negative feedback in protecting the 
output-input ratio y/x from possible changes in the forward trans- 
formation due to external disturbances, we now refer to formula (2) 

J l m V i u aSSUme that K and b arc instants, say 

100 and b - 0.09. Then, by substitution, y/x = 100/(1 + 9) = 10. 
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to M goZZ the? L m: ‘° ,e rE!ul1 10 l! >' operation of 
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We should note that positive feedback is not necessarily harmful. Com- 
pound interest is one beneficial form of positive feedback, as are most re- 
investment operations. Physical devices, such as the oscillators used in radio 
transmitters and receivers, are other applications of positive feedback. 
Indeed, if the amount of positive feedback is carefully limited so that the 
denominator of the feedback equation does not approach zero, the "ampli- 
fication" effects of positive feedback in increasing the effective value of K 
can be beneficial. (As an exercise the reader should, however, investigate 
the effect of positive feedback on regulation, as illustrated in the previous 
example.) 

Feedback with Lag 

Our introduction to the feedback principle has been greatly simplified by 
eliminating time delay. Although many devices may be constructed to use 
feedback without lag, the usual system takes time to perform an operation, 
or to perform a correction, thereby introducing a delay or lag in using 
corrective action. 

When we introduce lag, the subject of feedback becomes enormously 
complicated. Even a survey discussion of feedback with lag is beyond our 
scope. Appendix B gives a brief outline of some common methods of treat- 
ment. Using the methods of Appendix B ( Laplace transforms and z trans- 
forms), we could handle combinations of black boxes and feedbox loops, 
and reduce a complex diagram to an equivalent black box when lag is 
present. However, we will now restrict ourselves to an approach that is 
suitable for our needs, but which requires only simple tabulations. We can 
do this by concentrating our attention on periodic feedback correction and 
by using a numerical approach. 

Our objective in the next two sections is to show that when time delay 
is present in the application of corrective feedback, the character of the input 
variable, or signal, to the system is critical. The examples we selected for 
this purpose are specific to our immediate needs but we should note that 
the tabular, or numerical, method employed is generally useful for a wide 
range of business problems, for evaluation by electronic computers, and in 
general, for simulation purposes. Although it is not possible to develop 
transfer functions for the feedback equations used here (as in the previous 
sections), we may develop a block diagram, show the required steps in a box, 
and understand that we must sequentially apply the operations indicated. 

The effect of changing lag for a given feedback system input 

First, we try to illustrate how a change in time delay, or lag, can change 
the results of a feedback operation for a given input. A rule for production 
planning will illustrate the point 
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Example: Let the sales for week n be x„, as in Figure 2*11. This record 
shows a periodic cycle, because sales fluctuate from 10 to 20 and back 
to 10 again, etc. To smooth production, we have devised a rule that 
tells us that this week we must produce an amount y„ equal to 502 of 
x* our actual sales requirements for this week’s production, which we 
know, plus 502 of y,-i, equal to last week’s production. This decision 
rule can be written as an equation: 

y» *= Q.5x„ + 05y„-j case I 

Suppose someone suggests that, instead of using last week’s production 
in the formula, we use the value from two periods ago, so that a second 
decision rule might be 


y» — 05x« + 05y,.j case K 

We begin the computation by assuming that production for all periods 
previous to n — 1 has heen equal to zero. Then, by successive substitu- 
tion of values. we can obtain the results shown in Figure 2-11. 

We should notice several things in the table. With lag a feedback 
ST n ‘!i? ausK tlie 'y*'™ ° ul P u t to follow a "trajectory* of values 
with time. This trajectory describes the dynamic response of the system 


*■*’ A "twose comparison or 
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two feedback decision 


rules as defined In the text 
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to the stated input. Case I eventually assumes successive values of 13.33 
and 16.67, whereas Case II assumes successive values of 10 and 20, 
matching the input. However, previous to the 13th to 15th interval, the 
increasing response values represent a “transient" condition while the 
system is making the adjustment from no sales to the alternating values 
of 10 and 20. The final condition is frequently called the steady-state or 
equilibrium condition. 

We see from Figure 2-11 that both the transient and steady-state 
conditions of the system may be changed by a change in the amount of 
lag used in the decision rule. The desired smoothing effect is obtained 
in Case I and not in Case II. Moreover, we see that the output is de- 
termined not only by the lag and the constants in the equation, but also 
by the sequence of inputs. What would happen in Case II if the input 
were: 10, 10, 20, 20, 10, 10, 20, 20, ... ? 


The effect of a selected Input on an unstable feedback system, fixed lag 

In the above example we found that both average production rates were 
satisfactory from one point of view. Average production equaled average 
sales once the system reached the steady state; in addition, neither output 
variation exceeded input variation. 

However, we could well consider a system unstable under another input, 
even with a fixed delay. We could do this by selecting a decision rule that 
can exaggerate error, should the combination of input conditions and delay 
create' a positive reinforcement of the input. Moreover, in the example 
selected, we will see that the lag and the choice of transformation and 
feedback values can cause sustained oscillations in the system. 

Example: Consider a decision rule, with definitions as in the previous 
example, but with weights chosen so that 

y„ = 2(x„ — 0.5r/„-,) case nr 

If we could ever arrive at the point where x„ = y n . t = constant, say 10, 
and if the input were held constant at that value, then this decision rule 
would require constant production equal to sales, because 

y n = 2110- (0.5) (10)] =10 

But under an input disturbance, the combination of amplification and 
Ian can cause an unstable output from which the system cannot recover, 
as shown in Figure 2-12. Note that even after the input sales require- 
ments have dropped to zero, the system continues to call for oscillating 
values of production. 
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n 

z. 
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1 

10* 

10* 

2 

10 

10 

3 

10 

10 

4 

20 

30 
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20 

10 

6 

20 

30 

7 

10 

-10 

8 

10 

30 

9 

10 
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10 

10 

30 

11 

0 
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12 

0 
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13 

0 

-30 


*Za = 0 

*Vo = 0 


2-12 The response of an unstable feedback decision rule as defined In the text. 


Feedback with lag: general considerations 
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compared to the time between possible changes in the input to the system, 
or if the environmental changes affecting the transformation are more rapid 
than the system’s corrective-loop response. 

Usually, only two practical improvement methods arc available. If the 
ratio of (system correction timc/disturbance time ) is not much less than I- 

(1) Restrict possible inputs severely to those that change less frequently, or 

(2) change the feedback system to shorten the correction and transformation 
time. 


PROBLEMS 

2.1 The following gross descripUon of a manufacturing system is often used. 
Let F be the fixed costs per year associated with production and C be the 
variable costs incurred per unit of product. Also, let V be the annual volume 
of production (and sales) in units. Then the total cost incurred per year is 
TC s= F + CV, If the revenue per unit of product made (and sold) is R 
per unit, then the total annual revenue is TR = RV. The profit obtained on 
an annual basis is the difference between TR and TC, if that result is 
positive. A loss will occur if TR — TC is negative. The so-called “break- 
even" point occurs when the volume is sufficient to make TR = TC. With 
these definitions in mind, solve the following problem. 

A firm operates with a fixed cost of $1 million per year, variable costs of 
$10 per unit, and revenue of $30 per unit for a given product. 

A. What is the break-even point? 

B. Automation of the process will double fixed costs, but cut variable costs 
in half. What is the new break-even point? 

C. If the predicted sales are 100,000 units per year, and this comes to pass, 
what is the profit resulting from alternatives A and B above? 

D. Now suppose the sales prediction is not given as a single value but 
rather in terms of a probability distribution, as shown below. What is the 
expected value of the profit from each of the alternatives, A and B? (note: 
For simplicity use the midpoint of the range for each interval in the 
probability table.) 


Safes Forecast 
(in thousands) 

20—40 

4CW0 

60-80 

80-100 

100-120 

120-140 

Probability 

0.1 

0.1 

0.2 

0.3 

0.2 

0.1 


E. Can the break-even computation be considered a transformation? 

F. How does the break-even computation represent system structure? 

G. How is die break-even computation affected by the introduction of un- 
certainty? 
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23 Frequently it is desirable to measure from a distance or to telemeter in- 
formation from the scene of measurement to a safe or convenient observer’s 
location. In this process many conversion steps are frequently used, start- 
ing with the sensor that makes the initial reading and generates the 
message, to the final observer location and reading point. For example, 
suppose a pressure reading is desired from a remote location. The following 
steps might be followed: 


(1) A sensor converts a pressure measurement into a radio frequency. 

(2) That radio frequency, which changes with pressure, is transmitted to 
the reading location. 

(3) At the reading location, the radio-frequency signal is received and con- 
verted into an electric voltage. 

(4) The electric voltage is converted into distance, as indicated by the 
movement of a meter pointer. This meter is calibrated in pressure. 

e o server ooks at the meter, and interprets the result for his 
own use. 


Mtended'frTnq, ° f conversions > or transformations, in telemetry, may be 

extended to other steps as may be necessary. 

are fanSia^h^L? *501??* f* rneasurement conversions with which you 
familiar. Indicate the transformation to be made at each step in English. 

version operation** Spea * y “* * adl b,ock transfer function of the con- 
requential process? 5601111 cbaractcrist ic of each transformation made in this 


to read punched ** 1 * computer P ro Eram designed 
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3 Which cards will be printed if the condition of Switch A is initially 

OFF? If Switch A is initially ON? 

B. By what factor will the number of lines printed divide into the initial 
number of cards read? 

C. Suppose the fines printed were instead duplicated as cards, and that 
these cards were again fed to the computer. By what factor would the 
number of lines printed on the second pass divide into the original number 
of cards? 

D. If N is the number of passes, as described in C above, what is the 
relationship between the original number of cards, n, and the number of 
cards printed (or duplicated) on the Nth pass? (You may assume the 
original number of cards n is arbitrarily large.) 

2.4 The diagram on p. 54 illustrates how counters and loops may be used in 
combination in a logical sequence of operations (A counter is simply a 
computer memory location in which a number is stored.) 

A. Trace the operation of this system from the start. How frequently will 
cards be punched? How frequently will records be printed? How frequently 
will input records (cards) be written on tape? (Express your answers. in 
terms of statements such as "every fifth card," “every fiftieth card,” etc.) 

B. In designing such a system, which loop performs the most work in terms 
of frequency of activity? 

2.5 When a customer order is received, the following steps are performed, and 
the following tests are made. Draw a block diagram of the steps and actions 
taken as a result of the tests. Create a Decision Table that will summarize 
the results of the tests, i e., the actions that will be taken for each combina- 
tion of test results; 

1. The mail is received and the letter is opened. 

2. The order is edited by checking the price and stock number for each 
item ordered as indicated in the catalogue. 

3. The price is extended by multiplying unit price by the number of items 
ordered. 

4. The total value of the order is determined by adding the total amount 
for each item. 

5. When the total value of the order has been determined, the following 
credit checks are made: 

(a) If the value of the order is less than $1000 and the customers pay 
record has been good, the order will be approved for shipment; 

(b) If the value of the order is greater than or equal to $1000, the 
customer’s credit rating is greater than $1000, and his pay record 
has been good, shipment will be made. 

(c) Shipment will always be made if the customer has obtained prior 
executive approval as recorded in the credit file. 
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the total amount due. The accounts receivable will be credited with the 
dollar amount of the items shipped. 

2.6 In the following accounting example, a system of production, sales, and 
finance is described by eleven accounts. 

(1) Cash 

(2) Receivables 

(3) Raw Materials 

(4) Finished Goods 

(5) Fixed Assets 
(0) Production 

(7) General Administrative and Selling Expenses 

(8) Depredation 

(9) Sales 

( 10) Liabilities (Short and Long Term ) 

(11) Net Worth 

Several documents are available from the records of this firm: A Starting 
Balance Sheet, a Balance Sheet for the End of the Period, an Income 
Statement, and a list of operations leading to these results. This material 
may be summarized in the Input-Output Chart of Accounts, which is 
also given. The format of this accounting sheet shows the interaccount 
transfers. The Table shows credits (corresponding to arrows entering an 
account block) and debits (corresponding to arrows leaving account blocks). 
This convention corresponds to the credits and debits in the columns and 
rows of the input-output table, respectively. Total debits must equal total 
credits both for each account and in total, as may be verified from the input- 
output sheet. 

A. Draw a block diagram showing the interrelationship of the accounts of 
this company as indicated by the interaccount transfers for the period. 

B. Indicate the dollar transfers to and from each account. Relate the changes 
in each account level to the starting and ending balance sheets. 

Balance Sheet at the Beginning ^ 

assets: 

Cash 

Receivables 
Fixed 
Inventory 
Finished 
Raw Material 

TOTAL 

LIABILITIES ; 

Short Term 
Long Term 
Net Worth 

TOTAL 


$ 500 
2500 
2000 

1000 (200 units @ §5 each) 

500 (for 200 units) 

$6500 

$3000 

1500 

2000 

'$6500' 
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Balance Sheet at the End of the Period 


assets: 


Cash 

530 

Receivables 

1755 

Fixed 

1800 

Inventory 

Finished 

1000 

Raw Material 

750 

TOTAL 

$5835 

KLIXXES. 

Short Term 

1300 

Long Term 

1350 

Net Worth 

3185 

TOTAL 

S5835 


Income Statement 


SA1XS. 1200 units e $10 each 
COST Of salks: 1200 units @ $5 each 


Administratis e 

Selling 

Depreciation 

Uncollectable 


$3000 

1500 

200 

115 

$4815 


$12000 

6000 

CROSS MARGIN $6000 


$4815 

NTT PROFIT $1185 


Other Operations: 


I- Purchase of Raw Materials- m v 
AdnumsuaUveEspuiu^tz^-^ S, 5 ®' C '' d “- SKI25 > 

e. 53000, “ b 

«. Liabilities: paid o5 $6275 


fnput-Oufput Chart of Accounts 
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A company has four marofeewnns atpMtoente F U« » sates 
(1) Steel, (2) Laminations, (3) Meta! fabrication, (4) Foundry Opera 
lions, and (5) Soles Ut these departments be denoted by Di, Vn Vs, v*. 
S nd Dj In addition to the input of basic taw materials, each department 
consumes some of the output of the other departments, and conversely 
sends some of its own output to other departments. 

Table I shows the intoidepartmcnt transfers in thousands of tons tor a 
given year The sales, or system output, and raw materials used, ot system 
input, are shown for reference. However, we focus our attention on the 
interdepartment transfers, which show the structure of the system. Note 
that the total inputs and outputs are equal in Table 1, both for individual 
departments and for the system as a whole. This simplification eliminates 
consideration q( inventory, ot assumes a constant inventory level from year 
to year. 

We now consider the relative importance of the various inputs to each 
department For notational purposes, the “from" departments are numbered 


f ss l, 2, 3, 4, anil are shown as row m Tabic 1 The “to” departments are 
numbered f = 1, 2, 3, 4 and are shown as columns in Table I. The amount 
shipped from department i to department ; (m thousands of tons pet year) 
is called x 0 These are the entries in Table I. Thus, is 300, etc. The total 
amount shipped (or received) by a department is X, (or X/), the row (or 
column) total Thus, X u the total of row 1 in Table 1, is 1000. Similarly, the 
total of column X, is also 1000 for the method of measurement used. 

Now consider the numbers in Table II. These represent a ratio for each 
column, obtained by dividing the entries m each column of Tabic I by the 
corresponding row total (or column total) . This gives the fraction of each 
departments input that comes from other departments. For example, the 
total input to Department l is 1000 units, 200 of which come from Depart- 
ment 3 Thus, 0 20 or 202 of that total input comes from Department 3, 
etc. We perform a similar computation for each entry in Table I to obtain 
Tabic If, 


As a result of the computation above. Table II consists of the ratios 


Table t Transactions 



0i 

Oa 

03 

04 

Os 

Total 

0, 


300 


200 

500 

1000 

D* 



600 


1200 

2000 

O 3 

2QO 

100 

100 

100 

2500 

3000 

0« 



6C0 


400 

1000 

material 

BOO 

1600 

1500 

700 



Total 

1000 

2000 

5000 

1000 


7000 
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Table II Technological Coefficients 



d, 

d 2 

03 

04 

Dt 


0.15 


0 20 

D t 



0 27 


Oj 

020 

0 05 

003 

0.10 

D 4 



020 


Raw 

material 

080 

080 

050 

0 70 

Check 

totals 

1.00 

1.00 

1 00 

1.00 


a (J = Xy/Xt. Tliese fractions are always less than unity, but greater than or 
equal to zero. They are the technological coefficients that represent the 
structure of inputs to each department of the system. Conversely, it can be 
seen by inspection that, given xy and a,,, Xi may be found easily, because 
X , = Xy/fly. Thus, Xt = xis/flw = 200/0.20 = 1000, etc. 

If we assume linearity and stability of the technological coefficients of 
Table 11, we will realize that the system defined by the interdepartment 
transfers can be used for planning purposes. 

For example, we may now ask how much production in each department 
would be required to obtain a given level of sales from each department. 
If vve designate the gross output from each department (including direct 
sales) as X, and the sales from department i as b„ the input-output balance 
from the system provides a system of four equations in four unknowns. 

X\ — (inXi — ouXi — a lt X, — 014X4 = &i 

X, — QaXi — OttX* — ojjXj — <154X4 ■= 6 j 

A', — ojiXi — aoXt — o»Xj — 054X4 = i>j 

X4 — o*]Xj — 045X2 — OttX> — O44X4 = 64 

In these four equations, the required sales outputs b, would be given, the 
fly values are obtained from Table 11, and the total department outputs X, 
are to be found. 

A. Draw a block diagram of the interdepartmental transfers, and indicate 
on each arrow the amounts transferred between departments. 

B. Add to the diagram the inputs of the primary factors, or raw materials 
going to each department, and the sales going directly from each depart- 
ment to the customers- 

C. Add to this diagram (in a different color) the technological coefficients 
showing the fraction of each department’s input that comes from other de- 
partments- Note that these fractions will add to 1.00 if the fraction due to 
raw material from the outside is considered. 
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D Suppose the sales forecast for the coming year were 

61 = 800, b : = 1000, b> = 3000, and b, = 600. 

Find the required production in each department. Referring to Table I or 
the block diagram, determine the raw material inputs required. 

• E. By reference to Appendix A, note that the system of equations just 
described could also be written in matrix form as 


(l~A)X = b 

where 1 is a 4 x 4 identify matrix, A is a 4 X 4 matrix of the coefficients G,j 
obtained from Table II, X is a four element column vector of the Xi's (to be 
found) , and b is a four element column vector representing the sales fore- 
cast for each department. Solving for X gives the result 
X = (I — A)"*b 

where (I - A) ‘ is the inverse of (/ - A) as described in Appendix A. Using 
this procedure, solve for the values of X and compare the results obtained 
with the answer to D 


F. If many different sales forecasts were to be evaluated for the same 
technological coefficients, why would matrix method he preferable to 
the usual solution methods for the four equations that have four unltnowm? 

l£til7Jj^ i r°'l eia \ CO ' IS ! : '' r,ts Eiv "- 11 wffl be possible to 

££ ^ ”' UK X ">V Riven set of b„ assuming the b, 

naloStTcS'Ss d , T lW " 1 °‘ “V <*»*> i" Uhfe of tech- 
nalogieal coeffiaeuts that svould not pemit such a solution? 

P Unni "« ^emputation, no coostraint was assumed on 

duce whatever was Oriferrf.l^tThTeSw of'oo" 5 ^ ' VC C ° UM ^ 
of the variables X,, i e Dutfin< e “ , of “'“training one or more 

Sd z SJSSLrStaSS 

the teclmoloSl^fe^ my’byYcwf fa',' “l" Y ** “g 1 ” 1 " 8 ' ta 

accident in one department, etc . * technology, a disaster or 

For further tefcrctice .oo p n . , _ 

Llneor Programming e„d f Samuels™, and R. M . Solow, 

especially Chapter 9 ^ Mc 0, ">™1. New Yo.V, 1958, 

w. W. Leontief, The Structure o/rte'SL"’ 0 '' 1 .-'" “ , P u, cutput analysis; 
Umvmity Press, New YorV, 195°, Am * r ‘ c °’' Economy, 2nd ed., Oxford 
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will only take on the values 0, 1, 2, 3, and 4? Compare your alternate 
proposal to the flow diagram shown. 

C. In the original design, or in your new proposal, what additions and 
modifications would be necessary to provide change when the number of 
nickels and dimes in reserve was limited? Under which conditions would 
you prefer to give dimes for change (if available) instead of nickels? 

D. In C, what further additions and modifications in the design would be 
necessary to handle the situation when inappropriate or no change were 
available? 

E. In what way do the added features add to the complexity of the design? 
The flow diagram? The decision table? 


PROJECT 2 


An elevator with special timing features is to work automatically according to the 
rules outlined here. 

A. In Motion 


1. Going up 

(a) Will stop and open doors at any floor for which the up button has been 
pressed by a person who is either in the elevator or on the given floor. 

(b) Will stop at the top floor automatically, open doors, and reverse direc- 
tion of operation from up to down in preparation for next descent. 

2. Coingdown 

(a) Will stop at any floor for which the “down" button has been pressed 
either in the elevator or on the given floor, and open doors, 

(b) Will stop at the bottom floor automatically, open doors, and reverse 
direction from down to up in preparation for next ascent. 

3. In either direction 

(a) When stopping at a given floor, the register recording the stop signal 
for that floor is cleared 

(b) The elevator will test for the arrival at a floor stop continuously as it 
moves, and having reached the floor signal location will test for the 
specified stop conditions in sufficient time to stop if required. The 
detailed mechanism of this process may be omitted. Assume that the 
elevator may test for a “stop” or “no stop” at any floor by proper 
indexing of a test register as described hereafter. 


B. Halted 

1. At bottom floor doors will remain open until one of the following three 
conditions occurs. 

(a) A period of ((1) seconds has passed, the time ((1) to be set differently 
for different times of day (and there is at least one call for service up 
or down, otherwise after a wait of ((10) seconds, the elevator cheeks 
for calls, moves when there is at least one, but remains at the bottom 
floor until that time). 





(b) The weight of passengers on the elevator reaches 1000 pounds or more. 

(c) The dispatcher manually initiates a start Cram the lobby. 

2 At the top floor two conditions for a start prevail 

(a) A period of t{2) seconds has passed, the time to be set differently for 
different hours 

(b ) The weight of passengers reaches 1000 pounds or more. 

3 When going up 

(a) Will remain halted for 1(3) seconds with doors open, e(3) also ad* 
instable for different hours, or until the “close door” button is pressed, 
whichever happens first 

(b) WtU remain open while the “open door" button is pressed and will not 
start until tbis button is released regardless of previous condition (a). 

4 When going down 

(a) Wilijremain halted tor a tune <(4) with doors open or until the “close 
door" button is pressed, whichever happens first. 

(b) Will remain open while the “open door" button is pressed and will not 
start until this button is released regardless of previous condition (a). 

5 At any door closing time 

(a) Motion will n D t start until test of door closure is made; should door be 
open, t e test is repeated after a delay of one second, and continued 
“ is “ aW * !,er *"*“ cw - 

<b) ,atey ,M,U " w " als ° * «• <»*“ 

C. Register updating 
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R(ij) Upi _ 1 
Down 1 = 1 


Floor / 


(b) Variable timing feature. Let t(k) be the time values specified in the 
description above, where the coding means 


<0> 

U2) 


1(4) 

Bottom 

Top 

Up 

Down 


These data may be adjusted according to an hourly program (not specified 
here), and may be assumed fixed and given, as well as available for test 
at any time. 

(c) As a result of the notation for HO’,/), the following simplified notation 
may be used in analysis: 


fltt/) = 1 


RO.i)=o 
/ = / + ! 

/ = /-! 
i=l 
i ~ 2 
/= 1 
i = 4 
t 

t - 0 
t = tm 


the elector should stop at floor f 

when in state < 

the elevator will not stop 

go to the next floor above for next test 

go to the next floor below for next test 

the elevator is going up 

the elevator is going down 

the elevator is at the bottom floor 

the elevator is at the top floor 

the time value of a clock in seconds 

the clock has been reset to (is) 0 

(After reset, clock increments in seconds) 

has the value of the clock reached the test 

value t(k)? 


With this information: 

A. Draw a logical block diagram for the tests and operations required to control 
the elevator in the required manner. Use the suggested notation. You wall need 
a stop test, clear, and reset routing; a timing test, clear, and reset routine; and 
also a register data entry routine. 

B. Can you draw your logical block diagram in any alternate ways, i e., different 
logical sequences, and produce the same results? Can you simplify your logical 
diagram? 

C. Can you suggest other representations to portray the logical sequence of 
events required in this control scheme? 

D. If two elevators were to be coordinated so that one was generally going up 
while the other was generally going down, what additional modifications would 
be required in your diagram. ( Specify, but do not make these changes in your 
diagram.) 
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DETAILED METHODS 
OF SYSTEMS DESCRIPTION: 
THE FLOW GRAPH 
AND OTHER METHODS 


The block diagrams of Chapter 2 represented operations by blocks and 
variables that connect operations by arrows. An alternative procedure is 
the flow graph approach, which is useful in describing many systems, 
particularly those that have “linear" operations, or those that involve de- 
tailed probability analyses. 1 


FLOW GRAPHS: A GENERAL DESCRIPTION 

TheB ^-ypl, is Ike duel of the block diagram, i,e„ the same elements end 
Wrft ' r’T"’ in PM* the variables in the system 

a " d U ‘ e »r transfer function 

iZ nance . ta 8o "' ana >ysis) then becomes an 

applications. Tbe S So.v g™^' “i!'’ 11 ’' Sl “ pUfiB *he network in many 
analysis is reouired ( vv,» r St fPP r0 P nate ' v ‘hen detailed algebraic 

^ md to Z^erlnb : ^ ^ *0 bl4k dia- 

most convenient to JSj W * s, ' sll!ms de& Won, then use the form 

VWWr. f p pb see A. D. Hall, A 

mg, van .N cut rand, Pnnceton, N.J„ 1962. Chapter 14. 
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Flow Graph Construction 

To construct a flow’ graph, we first fist each variable in the system and draw 
a small circle (or node) for each one listed. Each such node is labeled 
with its variable name, e.g., x, y, or z. 

Next, we place the relations between variables (the transfer functions 
of Chapter 2, which will hereafter be called transmittances to conform to 
usual flow graph terminology) on the arrows connecting each “variable" 
node. The direction of the arrow shows the direction in which the trans- 
mittance is applied, and so provides the input-output sequence. 

Finally, we adopt three conventions: (1) All arrows leaving a node 
carry the variable at the node. (2) The value of the variable at a node 
is the sum of all the transformed variables entering the node as “signals" 
from incoming arrows. (3) The transformed variable at the point of an 
arrow head is equal to the variable at that arrow tail multiplied by the 
arrow' transmittance. 

Example: Suppose we have a pair of equations, y, = a,x and y t = bix, 
which relate two output variables to one input variable. Then, we 
obtain the equivalent flow graph shown in Figure 3-la. Similarly, suppose 



(a) 



*o 


* 1*2 + t>lt>2 

(C) 


O' 


3-1 A flow graph- 
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we have another algebraic equation, z = a 2 y t + b,tj u which relates two 
input variables to one output variable. The equivalent flow graph for 
this equation is shown in Figure 3-lb. Suppose now that we want to 
combine the three equations to show the relationship between z and x, 
talcing these variables as the system output and input variables, re- 
spectively. Application of the above conventions will reduce the graphs 
of 3-la and b to the single graph of Figure 3-lc, so that we have the 
equivalent transmittance (or transfer function) of the system a s zfx = 
a, a. — b,b_. (In combining the flow graphs, the intervening variables 1/1 
and y . are eliminated.) This result is the same as the result we would 
have obtained if we used block diagram analysis. 


Flow Graph Reduction 

The rales for flon graph reduction (as in the example above) follow the 
roles for block diagram reduction and arc summarized in Figure 3-2. 

Multiplication law 

The transmittanccs of two arrows in scries multiply (Figure 3-2a). 
Addition law 

..onTa'ddTrCeTob)"'™ am, " S Para,lc ' (g0i " g ta *> 

Expansion taw 

may be replaced r bv°transmnt folIowin S l he multiplication law, 

The feedback loop 

theu«k^g ra “ m ^^ r X* e ^”mT sm ‘ ,Un “ as for 

■O ..self fa ever dimtalsMog amouo^ w M * Va * ,M ' ~ hi baA ' 
geometnc series (1 + a + a 2 + a > +’ v*," 11 * ^ ~ a ) 1S 511115 a 
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O — 5 — *-o — b —o - 0—‘ b - Q 

fa) 



3-2 Rules lor flow graph reduction. 


Examples of flow graph application can be found in many fields. For 
example. Figure 3-3 shows a simple flow graph used to describe an economic 
principle. To ajjara his understanding of the method, the reader should 
devise such examples on his own. 


The Problem of Lag and Time Delay 

As was the case for the block diagram, we have presented the flow graph 
approach for algebraic transfer functions (or transmittances) that do not 
involve time delay; i.e., we have assumed instantaneous transformation of 
variables throughout the system. Although this assumption does not gen- 
erally hold, the block diagram and flow graph approaches have general 
applications, because the methods of Appendix B can again be used to 
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lc) 


3-3 A flow graph showing Lord Keynes' theory of capital movement to Illustrate 
flow graph reduction, (a) The theoretical relation between Investment I, Income Y, 
Consumption C. Production P. and Savings S. A certain fraction of Income, * In the 
flow graph, is devoted to consumption, so that C = kY. The other part of Income 
(1 — k) is devoted to Savings, as shown. If attention Is direction to the relation bo* 
tween Income end Investment only, the flow graph of (b) results, by the reduction 
rule of Figure 3-2a, For this purpose. Savings S may be omitted from the graph. By 
using the rule of Figure 3-2d, we can reduce (b) to the equivalent (c). This graph Is 
equivalent to the equation Y = I/ll — k). In which k Is the "propensity to consume," 
and t/lt — k) Is the well-known Keynes* multiplier. 


reduce. the time delay case to a form more suitable lor algebraic treftfeneot 
as described above. 


Equivalent Matrix ol a Flow Graph 

The equivalent matrix of a block diagram (see Figure 1-8) showed the 
system’s black boxes as rows and columns, and the variables connecting 
them as table entries. 

The procedure followed to develop the equivalent matrix of a flow 
graph is the same as that followed for a block diagram: The network nodes 
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are the rows and columns, and the arrows arc the table entries. However, 
the definitions of network nodes and arrows are interchanged when we 
use a flow graph. Therefore, as Figure 3-4 shows, the equivalent matrix 
of a flow graph fists the variables as rows and columns and the connecting 
transmittances as table entries. 


FLOW GRAPHS— STATE-DESCRIPTIVE SYSTEMS 

The term state refers to a particular and “discrete* system condition. 

Many systems can be described by discrete, rather than continuous, 
variables. Such cases arise naturally when the results of a decision or 
transformation can only be integers or whole numbers, or when the presence 
or absence of an attribute (e.g., red, yellow, green) is used to describe 
an outcome. 

The outcome of a system’s operation may be “use Plan A " or “use 
Plan B," which is a discrete result. We have one or two customers waiting, 
but not 1.5. A customer uses or does not use our product. An employee is 
in one tax bracket or another. A switch is “on" or "off." Indeed, any 
"continuous” system can always be described by an equivalent discrete 
system, if we agree to classify outcomes into a limited number of categories. 
Statistical measurements, for example, are often classified, or grouped, into 



3-4 The equivalent matrix of a flow graph, (a) A flow graph with variables num- 
bered 1 through 6 with the transmittance connecting the variables, (b) The equivalent 
matrix presentation. The system variables become the rows and columns of the 
matrix; the flow graph transmittances become the matrix entries. 
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10 or 20 categories, rather than handled on a continuous basis, because 
the resulting calculations are easier and result in little loss of precision.* 

Slate-Descriptive Systems and Transitions 

For the state-desenptive system, we usually want to know how the system 
makes transitions from one state to another, i.e„ how one condition makes a 
transition to another. 


The coin changing logical device of Chapter 2, Figure 2-7, was a 
discrete logical system in which the transition of states was conditional, 
«.e, the transition was dependent upon a set of specifications and input 
conditions (the coins received). In such conditional cases the block diagram 
description of the system, as in Figure 2-6, is often easier to handle than 
the How graph Howescr, there is a class of important systems for which 
system operation depends only upon the previous state of the sy stem, or is 
' l0na u P° n a e " simple factors. In such cases the flow graph approach 

TU ° eXamp,es " i11 ilIustra ‘e the distinction and 
show the meaning of state transition. 
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population tte will mSTw iW «*" fraction of 

assume in Figure 3-6 that ih t pomt ,n bme to the next. (We 
probabilities, remain constant Tactlons move ment, or transition 

probabilities arTsSi^.) ^ ^ * 1116 next - **- * e 

The examples illustrate the state-rW,,- .• 
graph is of major use: (1) The • • pllve cases in which the flow 
( 1 1 The deterministic state-descriptive system; and 

fractions Of cancm, TL , n ? rest 50 gallons J ’ g ’ instead of counting 

i* iLS'civ™ 
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3-5 A simple deterministic state-descriptive device, (a) Shows a shaft which may 
bo Indexed to one of four positions. A, B, C , or D. The progression of movement i3 
fixed In sequence by the physical construction of the indent and the clockwise move- 
ment of the shaft. The equivalent flow graph and transition matrix of this device are 
shown In (b). 



To state 


From 

state 



3-6 A simple probabilistic state-descriptive system. A person in state 1 Is a customer, 
one in state 2 Is a non-customer. The flow graph, and the equivalent matrix, show the 
probability that a customer will move from one state to another at the next observa- 
tion, or remain where he was. In marketing applications the transition matrix Is often 
called a “switching'' matrix. 

(2) the probabilistic state-descriptive system with constant probabilities 
of transition. 4 


The state-descriptive flow graph 

The examples also show how the flow graph rules are modified to show 
state-descriptive systems. The nodes of the flow graph become the states 

* Generalization is possible, but a comprehensive treatment is beyond the scope of this 
discussion. See A. T. Bharauha-Reid, Elements of the Theory of Markov Processes and 
Their Applications , McGraw-Hill, New York, 1880. 
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through which the system can pass, and the arrows show the transitions, 
either deterministic or probabilistic. For the deterministic system, all traps- 
nuttances are constant and equal unity. For the probabilistic system, the 
transmittances become the probability of transition. 


Matrix analysis of state-descriptive flow graphs 

Suppose the state vector (p„ p 2 ) is used to describe the probability 
that the system is in state \ or state 2 at a given time, where the sum of 
probabilities in the vector must be 10. Then, given (jh, pi) for one time 
or stage, the probabilities for the next time instant can be obtained by 
multiplying (p )f p, ) by the transition matrix lor the system. 

Example: Refer to Figure 3-6, which shows the transition matrix for 
the customer switching problem. Note the probability that a customer 
will be in state 1 at the next instant is composed of two possibilities. 

1. He was in state 1 at the last instant and 
He remained in state I, or 

2. He was in state 2 at the last instant and 

He made the transition from state 2 to state 1. 

Symbolically, 


and, similarly. 


pi * pip» + Pipn 


Pi - PiPa 4- p,pa 

which is precisely the same as the matrix notation 

w, po H 

V_P» p»J 

or, more compactly, 


Pi “ p.T 

P ‘ re I )te,m( Uhci and former slate vectors (proba- 
2L* 5 ^ ubm y“P«=>i«ly. and r is the transition matrix. To 
p0[, “ k!i ‘ m “ *P>i< 50/50 between states 1 and 2 
matrix of Frnim. w" °' 50 '- T ' len usin S *0 transition 

Bad It/ n' 1- ^ ° Vvin - 1110 above procedure, we would 

Wsnvefv ’ l ( 9 ' °f } L I™** can 1* *!*»<«» 

subseouetit sen-fT 3 ? of state probabilities for each 

° 0 °hservation. Such a trajectory will occur because 
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the system has a “memory" of one time period: Its next transition 
depends on the former state of the system.® 

The deterministic example (e.g., Figure 3-5) is a special case of the 
general transition problem with transition probabilities set equal to one. 
The same method of matrix multiplication can be used to trace the state 
transition of the deterministic system. The method is general for any 
number of states. 

Steady-state or equilibrium conditions 

Reluming to the example of Figure 3-8, we would discover, upon 
repeated computation of the state probabilities, that after a time the state 
probabilities do not change from one stage to the next. Systems of the 
type illustrated always reach such an equilibrium, which is independent 
of the initial or starting condition. We can find these final probabilities by 
noting that the equilibrium condition requires 

Pi = piPn + pipn 
Pi = PiPu + pipn 

By noting that pi + p t = 1, and using one of the above equations, we can 
find a solution. Thus, for the transition matrix of Figure 3-6, using the 
first equation, we have 

pi = 0.95pi + 0.03pj or p, = f pi 
and 

Pi + pi = pi + Ipi = 1 
Pi = f and pi = 1 

In the long run, given the transition probabilities shown in Figure 3-6, the 
market share will be 4. The method is general for any number of states. 


Multidimension state diagrams 

In some systems it may be desirable or necessary to define a state by 
two or more attributes. When this is so, a multidimensional state diagram 
can be useful in developing the transition matrix- 

* The Illustration given is a simple Markov chain, named after the Russian A. A. Markov 
(1856-1922) who studied such processes extensively. See W. Feller, Introduction 
to Probability and Its Applications, Wiley, New York, 1957. A series of interesting 
examples appears in J. G. Kemeny et al , Finite Mathematics utih Business Applica- 
tions, Prentice-Hall, Englewood Cliffs, N.J., 1962. For many interesting examples of 
the deterministic case, see W. R. Ashby, Introduction to Cybernetics, Wiley, New 
York, 1963 
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Example: A research team is assigned two types of projects : A — applied 
and B = basic, as determined by the supervisor. The team may be 
assigned any combination of project types at a given time (including 
none), but at no time may the total number of projects be greater than 
three. We can define i as the number of projects of type A and / as the 
number of projects of type B assigned to the team. Thus, a given state 
is distinguished by tbe index ( i, ;). The number of such states is easily 
seen from the two-dimensional state diagram of Figure 3-7. The diagonal 
line (i + j = 3) limits the set to the fen states shown. (The state proba- 
bility vector will have ten positions here, and the corresponding transi- 
tion matrix will be 10 X 10.) The concept of tbe two-dimensional 
diagram can be extended to more dimensions. However, once tbe con- 
cept is fixed, vector and matrix notation is usually more desirable. 

Since one of the most common errors in probability analysis is the 
erroneous omission of a possible outcome, the state diagram is a useful 
tool to insure completeness of description. 

Tree Diagrams and Sequences ot Transition 

A flow graph that shows the possible outcomes at each stage or step of a 
probabilistic or logical transition process as states has a branching appear- 


I ~ Number of 
basic projects 



appneg projects 

may ret ™mt>er of projecu (of either 

bfack below end on the dotted line (/ -f f ^^mations are limited to those shown In 
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a nee. The number of possible outcomes or the paths leading to given out- 
comes increases with the number of stages, hence, the name tree diagram. 
Such diagrams are useful in the description and analyses of combinational 
problems, where a sequence of events must be traced. 

Example: Consider the simple customer problem of Figure 3-6. Suppose 
we want to trace the possible transitions that a customer { state 1) 
can follow. Figure 3-8 shows the possible branching that can result. 

In this example the final states (at stage 3) are states 1 and 2, but the 
possible routes leading to each are shown. The probability of being in 
state 1 or 2 can be obtained by multiplying the probabilities on the 
scries of legs leading to a given outcome, and finally, by summarizing 



IRA transition tree. This tree shows the possible routes a noncustomer may follow 
in thrnp transitions. The probabilities of bemg in state (1) or (2) may be found at any 
ct b hv multi plication down the tree and an accumulation of the probabilities for 
Sen s^ate a the ««* «»• This is equivalent to the matrix multiplication of the 
probability of being in state (1) or (2) at the start by the trans.t.on matrix three times. 
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for each outcome state. The probabilities of being in state 1 or state 2 
at each stage are summarized below, rounded to two figures. 


ST ACE 

STATE 1 

STATE 2 

0 

1.00 

0.00 

1 

0.95 

0.05 

2 

0.90 

0.10 

3 

0.80 

0.14 


The sum of the probabilities for all states at a given stage must be 1.0. 
(The reader should compare the tree approach to the matrix approach 
illustrated on page 72.) The tree diagram need not be symmetric 
as in Figure 3-8, because the process of transition may terminate accord- 
mg to rules stated beforehand. In such problems, the decision tree is 
useful in defining the terminal conditions ot the transition process. 

E J I n1 p!e i, ,0n ! ! a ° 4 Smith ffi P a ““ under *6 following rules. A coin 
" , ft PP ed . ®ud Ae game will continue until a “head” appears or 

mud four flips have been made, whichever conies first. Jones will receive 
Smith ° C ? lrS (and ftl: S 1°PS). but will P a y 

w ' ' h aPPeMS a " d ,he Same “uWnucs. or terminates at the 
nrobabibhi "7 ma ^r outc °mes does this game have? What are the 
value o A W rm ay ,° S r f0r “* p0 “' We outa,, u e? 'Vhat is the expected 
of times?* S ,0nKl a!sumi "S "Us game is repeated n number 

Figure 3-9 shows the Sow graph tree of the game. The probability 
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of each transition is shown on each arrow, and the probability of each 
terminating outcome can be found by successively multiplying the 
probabilities on each outcome path. The amounts won (or paid) by 
Jones can be found by successively adding his profit or loss along each 
outcome path. Summarizing the results for all outcomes, we have 


OUTCOME PROBABILITY PAYOFF 


1 

0.5000 

1 

2 

0.2500 

0 

3 

0.1250 

-1 

•1 

0.0625 

-2 

5 

0.0625 

-4 


1.0000 



WORTH 


0 500 
0.000 
-0.125 
-0.125 
-0.250 

0 000 (expected value) 


The sum of all terminating outcome probabilities must be 1.0, and the 
expected value is the sum of the worth of each outcome (outcome 
probability times outcome payoff). Because the expected value of the 
game is zero, it is a "fair game." The correct construction of the decision 
tree permitted easy isolation of all the outcomes, and assured correct 
computation of the payoffs and outcome probabilities. 

Tree diagrams also have other uses, such as the generation of 
combinations of outcomes. 

Example: We plan to open a restaurant but must decide on the type of 
cuisine (Chinese, French, or American), the location (uptown, mid- 
town, or downtown), and the seating capacity (large, medium, or 
small). This gives us3X3X3 = 27 possible combinations (e.g., large, 
downtown, American restaurant and 26 other such combinations), as 
the construction of a tree diagram would clearly show. In complex 
combination or classification problems the use of the tree diagram can 
prevent erroneous omission of possibilities. 

The stages in the transition or logical process described by a tree 
may relate to sequences of decision in time, or sequences of actions and 
consequences. In such cases, a decision causes the branching in all or 
part of the tree. Thus, the term decision tree often describes this form 
of graph. 

Example: A decision tree, as shown in Figure 3-10, may be composed 
of both actions (decisions) and outcomes that may be due to chance. 
For example, once research is started, the outcome may be satisfactory 
or unsatisfactory, with an uncertain outcome. However, if the research 
is “satisfactory" according to stated criteria, the decision to produce 
now or later is not a matter of chance, but is controlled by the decision- 
maker. Estimates of the probability of each terminating outcome can be 
obtained from a diagram such as Figure 3-10. 
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O — Decision point 
O = Event, continuing 
• = Event, terminal 


A tree composed ot decisions and probability trials. 


Network Flow Problems 

The flow graph can be generated to describe special flow and capacity 
rcstnctions in networks. In many of these cases the nodes of the Bow graph 
OncaT P 0 '" 15 hme or space. For example, the fact that rve are in 
Scf e 8 'n,eV h th,S r.‘ ! Sunda> '' ™ rc P rcse "' a state, and be shown by a 
states For °m anow * ciln *' R ' n show the ways one can reach given 
(instead of T V. nTJ f ems ' he arr0 '« ol the flow graph could show 
point and anoth F '’’ ™ gth liTC ’ ” ^ ^"ce between one 
Zputalr™t, a , Tc We oI sy5,em Problem, thereiore, special 
SK' U5Cd SeVCTal S ™P ,iSa ' ) — Pl- HI illustrate the 

ES? ,0 Bnd IonsKt <? 

by each arrow What areth 10 ?^. 6 "^ 11 that fhe distances are shown 
% inspection *" " D? 

ABCD = 25 
ACD =22 
AD = 20 

22?1 PMh ABCDfa ,h = h »* °" d P* AD is the shortest, 
a networV of maximum ffo^ of fl0W g ' aph ,e P res ents 
time. We may ask for the maximum duct ***** ma V at a given 
opacity of the network to handle 



30 Detailed Methods of Systems Description 


For illustration, assume that the distance AB ~ BA by the method of 
transportation used, so we can travel in either direction for the same 
cost (as shown on the arrows) If we start at, say, A, and must visit 
each of the remaining cities once and only once, we may ask for the 
cheapest route. We have a minimum sum problem (although generally, 
we could ask for any function of the values shown). For this case the 
problem may be solved by enumeration to obtain the minimum value 
of 55, This is a simple version of the so-called traveling salesman 
problem. 


PROBLEMS 

3 1 Using the rales to, Dow graph smp li EcaKm described i„ this chapter, fed 
“'C 1 "" transmittance between the smallest numbered and the highest 
numbered variable m each of the four flow graphs given on p. 81, 
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SnTnX <” «>« &st equation. Using Ore 

the second equation l ^ e arrows { ° T ‘he trarumittances indicated by 
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orv t at ihc ratio of »s/Xr IS (by reducing the flow graph) 
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„ No)e ** ^-lt+e)+he 

Compare the flow KraDh^Wr^° Ve ‘ I* 16 , varia ble a, has been eliminated, 
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conditions would the fl,™ „„ i i>om another. Under sshal 

“°' v 8ta l> h "PProach be most useful? 

3-3 The diagram on p go 

separated into activities (amwsl 3 10 ^ rcsearc h and development project 
been niunbeied so £ *?* <®der>. The diagri, has 

number than ,t, ‘enmnates at an cent of higher 

aclmty is go t . ri a |„ ) , c (iic L Jme required for completion of each 
Protect description f s „ scd , . Out activity. (This form of 

S a t“' fcm “ 01 Untical PathsJST- tcc,,ni q n ° °f planning and in 
■»e»t of oomples Pa ' h ^edulmg „**, widely in the manage- 
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«) 

Problem 3-1 




... , . . j u have been completed), what is the longest 

until both activities 2,4 and 1,4 ha event 7. the 

, .. mute which determined this maximum sum. 

B. Mark on the diagram the route which 

This is called the critical path. r 

. , completion tune and the critical path u 

C. What happens to ““/ uced 7 toe periods to 1? 

the time required for activity 4, . {f 

i iso** rr>mnletion time and the cnhcai path it 
O. What happens to the “ rt f‘ ^ ed t0 0 ffine periods? 
the toe required for acnvaty 4,5 is tedocea to 
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Problem 3-7 


Assume that both S and \V make independent moves according to their 
transition matrices at each new instant of time and that by this process, a 
"steady state" situation has been reached. What fraction of the time is the 
total system composed of S and \V working? (Hint: find the steady-state 
probabilities of S and W, then use the multiplication and addition laws of 
probability to compute the probability that the machine will be working. 
For illustration, a diagram of one system meeting the description is given. 
In this diagram, a light bulb lights when one of the specified conditions is 
specified.) 


Matrix for S 
-*S t St S t 


Matrix for IF 
-»TFi W t IF, 


*r» * >1 >r.r i » n 

a i i ! in i j i 

4» i iJ jp.L i , jj 

A Possible SAV System 

3.8 The output of a system y[t), a function of time, is given by the following 
eqoahon, In which a and b w constants. The input to ihe system is x(i) 

vM~anCI)-i— 

di> 

l/b — u mw? 5 transfonn approach of Appendix B and the substitution 
/b ~ the function of the system, Y (,)/*(,). 

d M rS.to™^ l r h ep,S?" <,) “ * “«"*'■ “ *<*> = >• « 

,l XM = l/ *- Nrte 


A Bb 
* e’ + tr** 
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D. Find y (I) if x(l) is a ramp, i.c., if X(i) = 1/s 1 , Note that the appro- 
priate partial fraction expansion is 


>• + «■ + tr,' 

E. Find y(t) if x(t) is A sin wt, where A is a constant and w ^ w 0 . Note 
that the appropriate partial fraction expansion is 


a* + w#* a* + u>* 

F. In E what is the result if w approaches w 0 ? If f is measured in minutes 
and w = 2*/, where / is the frequency in c>des per minute of the input 
variation x(t), what is the critical frequency for this system? 

(In the above problem, the required transform pair is 
Function Laplace Transform 


&in irt 


to 

a* + ta* 


where to is a constant representing the angular frequency in radians, or 2 *■/.) 


*3.9 The output of a system, y(n), is given as a function of its input x(n), and 
the output at previous tomes, y(n — 1) and y(n — 2). 

y(n) = 0.5x(n) + y(n — 1) — 0.25y(« — 2) 

A. Assuming that the time penods n are equally spaced, find the transfer 
function of the system in a- transform terms by using either algebraic or flow 
graph reduction methods. (Refer to Appendix B.) 

B. What is the output response of the s>stem if the input is a unit impulse, 
x(n) ss 1, n = 0; x = 0 otherwise? 

C. What is the output of the sjstem if the input is a unit step, x(n) = 1, 
nS=0? 

Note that the partial fraction expansion required is 

(1 - z) (1 - 0.5a)* (1 - 0.5a) 

D. Compare the results obtained in (B) and (C) with the results you 
Would obtain by direct numerical computation of y(n) for a few values. If 
the results of the numerical computation and the algebraic values of y(t) 
do not agree, try again. They should. 
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AND HIERARCHIES 
4 IN SYSTEM DEFINITION 
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permits the investigator to simplify or expand his system definition. Thus, 
when defining a system, we say that an element or transaction belongs 
to a given class, or may itself own or encompass other elements or trans- 
actions. Similarly, we say that several systems belong to the same class, or 
own other subsystems. 

Although classification methods are familiar to anyone who reads or 
speaks (because language itself is a classification form), we will devote 
this and the following chapter to some of the special classification tech- 
niques of immediate interest to us as systems analysts. 


DATA AND FILE ORGANIZATION 

If data are formed into records, and records are formed into files, some 
ordering method is necessary so that we can easily obtain information 
when we need it. File organization has particular importance when data 
must be collected for system definition, or for the design of information 
systems with storage points for data in the information flow sequence. 

File organization using codes is discussed in Appendix C. A few 
examples of general file structure will be useful here. 

Example: Suppose we have an information file consisting of the names 
of employees in each department of a manufacturing firm. We may 
then inquire about the structure of this list. Conversely, we may have 
the names of a group of employees and wish to structure the list in a 
form representing their function, ability, or other classification group. 

We could use the following outline: 

I. Department 

A. Name 

1. Last 

2. First 

3. Initials 

B. Address 

1. City 

2. Street 

3. Street number 

C. Job 

1. Job name 

2. Work center assignment 

3. Rate of pay 

4. Seniority 

5. Skill level 

6. Educational level 
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D. Non job Skills 

1. Manual 

2. Hobby 

3. Language 

4. Education 

5. Other 

E. Nonjob Memberships 
L Fraternal organizations 

2. Professional or worker organizations 

3. Community organizations 

4. Other organizations 

Although we recognize that this “outline" is not the only one we 
could have devised, it does describe the actual way the files for cm- 
ployees could he kept in a given organization or. specifically, the way a 
given personnel form has been structured. 

Note that the organization of the file, like the system using it, is 
determined by (a) the distinct elements in the file; and (b) the re- 
lationship or ordering of these elements. Also note that we could physi- 
cally arrange the data on a particular clement without regard to 
hierarchy, but if we wish to extract information from a collection of 
records, or organize the elements in the file, we must impose a hierarchy. 

For example, suppose we arrange the file described in the above 
outline in tabular form with the data in columns. The row of information 
concerning a given employee (which is a vector describing the indi- 
viduals attributes) has the physical ordering of the table columns, 
but no necessary hierarchy of items is implied in the description. How- 
ever, when we ask for an ordering of individuals by given criteria, say, 
for example, by last name, first name, and initial, then we imply a 
hierarchy of classification (vertically) in which the last name is more 
important than the first. Similarly, we may wish to extract from the file 
all persons who are machinists with more than five-year seniority and 
who belong to the Benevolent Protective Order of the Elks. The 
hierarchy Job Name/Seniority/Fraternal Organization is implied then 
in the sort required for extracting the data (Job Name would therefore 
be the major “key"). 

Similarly, records may be ordered in an alphabetical hierarchy, or 
in a hierarchy according to time, or in another combination of major 
and minor categories, as chosen by the analyst. 


Arbitrary or Artificial Hierarchies 


Thus, the clement data in a file may be arranged in many ways, and t 
particular quesUon or senes of questions posed by the investigator sets t 
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hierarchy or levels of classification. In one sense, such a series of questions 
may be thought of as an arbitrary hierarchy or classification that results 
from a particular application. The resulting hierarchy is seldom exhaus- 
tively complete. 

Certain random access files are good illustrations. For example, the 
McBce Keysort System stores data on edge-punched cards. In the McBee 
Keysort System, elements with specific characteristics can be drawn from 
the file by the insertion of a needle, which retains the unwanted cards and 
releases the wanted cards, edge-punched for the appropriate key. (For 
further detail, see Chapter 10, Figure 10-5.) 

In such files, the physical position of a wanted card in the deck of 
possibilities need not be known, and a wide variety of transaction sequences, 
or hierarchy arrangement can be produced as needed. The file hierarchy 
is random, until some external criteria arc imposed. 

Natural Hierarchies 

We can contrast the random file and its artificial manipulations with a 
classification scheme that is designed to be complete and exhaustive and 
that explains phenomena or structures observations in nature. The familiar 
chemical and biological classification schemes are examples. 

The classical example is Mendeleev’s classification of the chemical elements 
based on their arrangement into vertical columns and horizontal rows accord- 
ing to their atomic weight. According to Mendeleev’s own account this idea 
came to him in 1868 while he was writing a textbook of chemistry. He was 
looking for an easy classification for teaching, rather than for serious scientific 
purposes. . . . Having entered the atomic weight and other characteristics of 
every element on separate cards, he was struck by the fact that the properties 
of the elements were related to their atomic weights. From certain gaps in this 
Periodic Table, chemists were quick to conclude that the empty spaces repre- 
sented undiscovered elements, an assumption that was amply rewarded by 
their experimental discovery of the new elements Gallium, Germanium, and 
Scandium, which were found to fit neatly into the gaps.* 

Such natural hierarchies are, in science, presumably related to actual 
relationships and orderings observed in nature. The more nearly a hierarchy 
conforms to the ideals of completeness, consistency, and realism, the more 
“natural” it is. In this sense, classification may be thought of as a basic 
tool of science, which studies the properties of classes of phenomena, rather 
than of particular instances. Thus, it is assumed in science that, once the 
results of a particular experiment have been observed, these results may be 
imputed to the class of experiments to which the one belongs. The ability 
to transfer results correctly depends upon the accuracy of classification. 

*R. Lewinsohn, Science, Prophecy, and Prediction. Fawcett Publishing, Greenwich, 
Conn., 1962, p. 35. 
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Soma Natural System Hierarchies 

Just as a sjstem may be defined differently for different system objectives, 
so may the hierarchies describing system structure. Some familiar natural 
hierarchies are the formal and informal organization chart, the bookkeeping 
chart of accounts, and the functional hierarchies of equipment and work 
groups. Ancestral hierarchies, or family trees, are a structural hierarchy 
based on heredity, and historical trees, or hierarchies of events, are also 
in common use, as shown in Chapter 3. 

Note that a given physical system may involve several hierarchies at the 
same time. Thus, a given firm may have a formal organization chart that 
differs from its informal organization, both of which may differ from the 
functional hierarchy of work assignments. 

Systems also exhibit hierarchies of objectives, the purpose of which, 
for example, may be to integrate the operations of several subdepartments 
to superior departments and the company, or organizational, purpose. 
Similarly, a hierarchy of plans for each department, classified also into 
immediate, short-range, and long-range categories, may be considered for 
a given system. 

Because this wide variety of hierarchies can define a system at any 
given time, we should specify the classification mode to be used in a given 
system definition. 

In the same way, we can classify a system component or transaction 
(which we will hereafter call an entity to denote an item to be classified) 
in several different categories simultaneously if vve use different hierarchies 
to describe a given system. Thus, an employee may belong to one category 
in the formal hierarchy of an organization, to another category in the 
informal hierarchy, and to yet another category in the functional work- 
assignment hierarchy. Such multiple classification of the employee is con- 
sistent provided vve understand to which hierarchy a given classification 
category belongs. 


Hierarchy Change* 
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MECHANIZING SYSTEM DEFINITION 

System definition often requires massive data-processing efforts. This is so, 
for example, when an industrial information system must be studied, when 
many possible transactions occur over a period of time, or when a system 
contains many alternate transformations. Moreover, a number of individuals 
may be required to define a detailed system in a reasonable time, and, if 
it is to be investigated efficiently, the system must often be defined in a 
way that can be transferred to data-processing equipment. 

The usual method of organizing such definition efforts is to employ 
uniform data collection and data-processing methods having rigid format 
specifications. In practice, this is usually accomplished by developing a 
manual of standard terms and definitions, and, thereafter, by using codes 
(vectors) or files of codes (matrices) to maintain a consistent, complete 
description method. The approach is familiar to the field interviewer and 
opinion-poll taker who code and edit responses to questions so that the 
responses obtained may be analyzed statistically on machines. The approach 
is also familiar to those who must define or specify large networks, such 
as critical path or project planning networks for a construction project In 
such project diagrams, events in time, or states, are numbered and the 
connecting activities arc designated by a pair of event numbers. Similarly, 
when a large hardware system is to be constructed and documented for 
later maintenance efforts, it is usual to code component parts or black 
boxes, and to code the connecting wires by color, number, or coded 
mechanical fittings. 

A general technique for mechanizing a complex system definition can 
be structured either from a block diagram or from a flow graph viewpoint, 
using classification techniques. For illustration we will consider the block 
diagram approach. 

Suppose we consider a system with many elements, or components, 
and several as yet undefined transaction types or relationships. (See Figure 
4-1.) The elements may be physical components, departments, machine 
groups, or blocks defined to suit the purpose of the investigator. 

A Data Collection Classification Procedure 

A systematic method of system definition can then proceed as follows: 

1. List, with an appropriate name, each element to be defined. 

2. Number, or otherwise uniquely code, each element so listed. A simple 
serial listing is often satisfactory here, but if later grouping of elements 
is contemplated, a partially blocked code (see Apendix C) may be used to 
identify similar elements. 

3. Develop categories for each transaction that may be encountered, 
and develop a code or check list for these attributes. Such a code will 
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**t Flow diagram ol a simple system. Indicating material and Inlormatlon flows. 
(Solid line, material. Dotted line, information). (Taken from 0. Herta and £ Eddlson. 
Eds., Progress In Operations Research, Wiley, New York, 1964, Vol. ». p. 130.) 


certainly contain a space for the “From" and “To" element codes and will 
also contain additional information that may be useful in later analyses. 

It may be desirable to include some space for a transaction (or report) 
name in some systems. One possible code type is shown in Figure 4-2. 

4. Construct, for each element, an clement sheet such as the one shown 
in Figure 4-3. The physical farm of the element sheet can range from 
simple cards to extensive dossiers, but regardless of the form, the element 
sheet should show the element name and number, and should provide 
space for the specified transaction data, which is arranged in a uniform 
format. The element sheet is usually divided into two parts: one records 
the structural information that relates incoming and outgoing transactions 
to the given elements; the other shows the detailed information on trans- 
formation procedures at the element and the detail of any files, storage, or 
delay that occurs at the element An instruction sheet, or manual, containing 
the method of data collection and the instructions for filling out the element 
sheets can then be provided to a team of investigators, together with 
assignments to investigate specific elements. The investigators will also 
need to bav e the coded clement dictionary or cross reference to fill out the 
transaction codes correctly. 
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5. Collect the information required on the element sheets. (For hard- 
ware systems, the required forms may be completed by several design 
groups or a team of engineers familiar with component specifications and 
connections. ) 


Data Assembly 

We may now put the element sheets together as follows: 

1. When they have been coded to show all transactions entering and 
leaving an element, the element sheets can be assembled by sorting the trans- 
action codes. Various system descriptions can be produced, depending upon 
the selection of transaction code categories. 

For example, for each clement sheet, consider first only those trans- 
actions reported as entering each element. This information can be extracted 
from each element sheet and a summary of all transactions created. For 
ease in later sorting, it may be convenient to create one IBM card or unit 


COLUMN NUMBER 

7 8 9 10 11 12 13 14 15 16 17 
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RATE RANGE 

TIMING 


TRANSACTION DETAIL 




FLOW FLOW 
TYPE TYPE 


FIRST SECOND 


PRECEDENCE 

requirement 


CRITICAL 
COMMITMENTS 

4-2 A transaction coding stricture. This is one yra, the transactions ol Figure 4-1 
can be coded. Tbe chart above illustrates the arrangement ol the code dibits with 
description and example given below. (Taken from D. Hertz and R. admE* 
Progress in Operations Research, Wiley, New York, 1954, Vot II, p. 131 ) 

1. Row typo. Col 1. Material 1. Information 2. 

2. From element, numbered according to diagram. Col. 2. 

3 To element- numbered according to diagram. Col. 3. 

4. Transaction rate between elements shown. Cot*. 4-6 (umt* pe, mona) _ 

5 Unit* of measure, transaction rate. Col. fl. receipt* order, 2. £££ £*?®***; , 

6. Timing of flow in day*: Col*. 10-11, range. Cot. 12. portt 3 - tfipment* 4 

7. Critical commitments: Col. 13, orders 1. approval ol orders 2. 


9, Precedence requirements. Col 1*. B 

Col 17. flow required (To be used if necessary, provision tor 
no precedence requirements I 
Note: In all case* x. xx, or xxx Indicate* blank. 


— ..,Mi red; Col. 16, arre 
*«t precedence only, bl* 
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Element Sheet 


mum to aid i™» »» •»««* «*«» “«• 

and nets any special detail below) — ■ ■ 

rr-rTTTTh TniL^^ 
rTTTTTZQIIII^^ 
i I l l M I T 1TTX IIXIX3 
nH-TTTTT Xm^^ 

Special Notes 

t Oescnbe uanstormaticn at this element 

2. Emergency input-outputs 7 

3. Translotmalion rates? 

* Critical decisions and commitments? 

5. Detail critical and normal operating restrictions 

6 Special and normal resource requirements 

7 Other notes. (Political, personality, quality, technical, or other 
problems here?) 


4*3 An element sheet lor organizing data collection. This sample can be simplified 
or expanded as needed, but it suggests a method of collecting and organizing data 
on an element so that the structure ot the system can be developed later, either 
manually or by machine Using element sheets ot this type, many people can work 
on system data collection. (Taken Uom 0. Hertz and R. Eddison, Eds., Progress in 
Operations Research, Wiley, New York, 1%4, Vot. II, p. 129.) 


record for each transaction or to place the transaction data on other 
similarly flexible machine-readable media. 

To illustrate, suppose that, by using the conventions of Figure 4-2, the 
18 transaction records of Figure 4-4 have been obtained from the field 
survey, (By that convention the transaction 1141001QU53x62xx means that 
a material flow occurred between element l and element 4; the average 
rate of flow is 100 receipts per month at element 4 with a range of 10; 
IS days is the average time of flow from 1 to 4 with a 3-day range; no 
critical commitment is involved in this transaction, but initiation of this 
flow required a flow from element 6, which is an order.) 

From these transaction data we can create many maps of the system 
by appropriate sorting procedures. Consider first the matrix display that 
tan be obtained from the first three digits of the 18 transaction codes as 
shown m Figure 4-4. The transactions in the figure were sorted in “To" 
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114100101 153x02a 187004004071x62a 236490402011x52a 265004 OOZOOOUxxx 
1 42050 1 04052x7 la 216100103031x62a 24Q1001 03011x11a 2760010G3000aax 
147100101011x11a 2235001 0203 lxxxa 253500403000232a 257004002000x6271 
174050 1040 10x62a 232500403031x52a 26U00l0203Uaa 
175004004010x82a 235500402010x22a 267490102011x32a 


4-4 Coded transactions. Note that there are 18 codes, one coded line tor each arrow 
In the diagram of Figure 4-1. Each line contains the detail required to analyze various 
system properties, according to the coding structure given in Figure 4-2. (Taken from 
D. Hertz and R. Eddison, Eds., Progress In Operations Research, Wiley, New York. 
1964, Vol. II, p. 130.) 


element order, and spread in “From" element order. The type of trans- 
action is entered in the appropriate From-To cell of the table, with (1) 
denoting material and (2) denoting information. From this matrix (Figure 
-1-0 ), we can draw an equivalent diagram of Figure 4-1 using the methods 
of the previous chapters, (If the transaction codes have been recorded 
on machine-readable media, the creation of these outputs can be mecha- 
nized. Similarly, other displays of system structure can be made by calling 
for other combinations of information. The procedure increases in value 
with a larger number of transactions and increased code complexity.) 

2. Variations in the form of system construction are available once the 
transaction codes have been summarized. For instance, if the data have 
been collected correctly, each To-From transaction should match identically 
one From-To transaction. For example, there should be exactly two trans- 
action codes of the form 114L00101153x62xx in the transaction list shown 
in the example because this transaction will be reported as leaving element 
1 on the element sheet (for element 1), and also reported as entering 
element 4 on the element sheet (for element 4). Thus, a check is available 
on the consistency and completeness of the data collection. Many other 
questions, for example, those shown in Table 4-1, can be answered about 


'^A^slmilar form of tabular comparison can be made from the data 
collected on element files and storage points. The contents of files at various 
elements can be compared for duplication and activity if the transaction 
code has been developed to show this detail. Processing load reports and 
cross references showing the elements affected by a given transaction are 

also available from the data collected. . . , 

With the basic structural information now accessible, the way is cleared 
for refinement of system definition along whatever lines the invesngators 
find necessary and a consistent background is available for later analysis. 


simulation , or improvement . , , 

3 In conclusion, several remarks may be useful. The procedure described 
could also proceed by first defining all transaction types of interest, and 
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From element number 




2 

3 

4 

5 

6 

_7 

8 

1 






2 



2 



2 

1 





3 


2 



2 




4 

, 






LL 

□ 

5 



2 







2 


2 

2 



2 






1 


2 


1,2 






2 

1 



From 



4-5 One of the many matrix displays that can be obtained from the coded transaction 
data. This table contains all the Information available from the flow diagram of 
Figure 4-t, and was obtained from the first three digits of the coded transactions, 
shown In Figure 4-4. The code 1 in the matrix indicates a material flow; the code 2 
Indicates an Information flow Note that In matrix displays of this type, the number of 
entries across a row Indicates the number of arrows coming Into an element; the num- 
ber of entries in a column indicates the number of arrows going out from an element 
The diagonal In this type of display Is usually blank, unless flows (or transitions) take 
place on a probability basis. (The matrix shown is the transpose of the transition 
matrix usually found In mathematical literature) Entries above the diagonal Indicate 
flows from higher- to lower-numbered elements; entries below the diagonal Indicate 
flows from lower- to higher-numbered elements, a property which may be useful In 
later analysis By using the display method described here, the code numbers as- 
signed to elements must be unique, but need not be In any particular order. 7 ho 
matrix display of data can be used In reverse to construct the How diagram. In 
particular, data collected from the element sheets, which result in one code per 
transaction, can be analyzed quickly for conversion to matrix form; for example, the 
second column of the code (Figure 4-4) Indicates the column of the matrix entry; the 
third column indicates the row of the matrix entry. By considering each transaction 
code at a time, the entries m the matrix are easily obtained. From the matrix, the 
flow diagram may be charted easily. Using this sorting procedure, element sheets 
from a number of observers can be combined mechanically. (Taken from D. Hertt and 
R. Eddison, Eds., Progress In Operations Research, Wiley. New York, 1964, Vol. U. 
P- 132.) 


by creating a transaction sheet. The procedure of data collection would 
then be to trace the required transaction types (or reports) throughout 
the system, recording each From-To element on the transaction sheet. This al- 
ternate approach (comparable to the flow graph approach of Chapt er 3) 




Table 4.1* 


Some Questions That Can Be Answered from the 
Information Contained in the Data Given in Figure 4-4 

1. Do any dements receive more information than they send? (For all transactions 
containing a 2 in Column 1, Search Columns 2, the “from” column, and 
Columns 3, the “to” column. If the frequency of entries for a particular ele- 
ment in Column 3 exceeds the number of entries m Column 2, that element 
receives more different flows than it sends Elements 6 and 7 are m this class , 
as reference to Figure 4-4 confirms. Frequently such elements are decision 
makers, or trouble spots 

2. Are there any elements that send out more information than they receive? 
(Reverse the procedure above ) Elements 1 and 3 are examples in Fig. 4-4. 
Frequently such dements are service units, or in an inferior control position. 

3. Which clement in this system has the greatest variety of inputs and outputs? 
(Scardi Columns 2 and 3 of the transaction code The element which appears 
most frequently in this search is the one required ) Element 0 here. 

4. Arc there any dements that deal only m information? (List all elements that 
appear in either Column 2 or Column 3 and have a 2 in Column 1. Next see 
if any of these numbers are duplicated in Column 2 or Column 3 of trans- 
actions having aim Column 1. If not, such dements deal only in information. 
Element 6 is such a case in Fig. 4-4. 

5. Do any elements deal only in material? (Reverse the procedure given in 4.) 

6. Which flows involve critical commitments? (Pull out the transactions with an 
entry other than x in Column 13.) Flows 5-3, 6-1, and 6-8 are examples in 
the data of Fig. 4-4. 

7. Is there a tightly coupled flow sequence in this flow chart? Yes, there are 
several if the precedence requirements are checked. (To obtain this result, it is 
convenient to trace backwards from an output to an input. For example, the 
flow from 4 to 2 is an output, but flow 4-2 requires one from 7 to 4, the 
flow 7-4 requires one from 6 to 7, 6-7 requires one from 3 to 6, etc., by 
inspection of the precedence requirements stated in the code of Fig. 4-4 Thus, 
the reverse of a sequence built up in this way is “tightly coupled,” or must 
follow in the given order. For the sequence illustrated the order is 2,3,5,3,6,7,4,2. 
Another such sequence is 2,3,5,3,2, or 6,1,4. It may be useful to distinguish 
“looping” sequences from those which are not looping. The nonloopuig sequence 
indicates a buffering in the system flow. Thus, the sequence 2,3,5,3,6,7,4A 
does not directly involve element 8, production, since the inventory element 7 
acts as a buffer under normal conditions. 

8. Where are the major information lags? (Scanning Columns 10 and 11 for 
transactions having a 2 in Column 1, we find that one to three da>s is the 
usual lag in information flow in this sjstem. However, if we adopt the con- 
vention that a delay of zero time should be checked further, we find that 
information flows 6-8 and 8-7 occur with only a small frequency compared 
to other transaction rates. This distinction can be used to indicate periodic 
reports, or, an alternate code can be used for this purpose. Thus, there are no 
major information lags in this system, but the frequency of periodic reporting 
and ordering may be of interest.) 

• Taken from Hertz and Eddison, Editors, Progress in Operations Research,, Wiley, New 

York, 1964, Volume II, pp. 140-141. 
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produces *o same result as the element sheet approach and, in addition, 
mav be more convenient in some applications. , „ • 

We should also note the advantage of having a method for defln,n ? a 
system quickly. By allowing many investigators to collect data at one time 
(rather than few tvorhng for a long tune), a "snapshot of the system can 
be obtained rather than the blurted results that could result if the system 
changes while the data collection is in progress. In many applications this 
is an important statistical advantage. 

The suggested procedure also yields economies in data collection and 
analysis which should not be overlooked. For these reasons a numberof 
automated procedures of the type illustrated have been developed. The 
best known of these is a procedure of data collection with computer 
analysis and display called AUTOS ATE (for Automatic Systems Analysis), 
developed by the RAND Corporation for use by the Air Force in defining 
and improving paperwork systems. The method is general, however, and 
may be applied to systems of all kinds.* 


CLASSIFICATION DEFINITIONS OF DYNAMIC SYSTEMS 

The method of the previous section is satisfactory for defining the static 
status of a system (or the average status if average values are collected). 
However, a method of definition is often needed that will define a system 
on a dynamic basis, for example, so that the system can be simulated. 
Simulation methods will be discussed further in Chapter 12, but one popu- 
lar method of system definition suitable for the purpose is appropriate 
here. We will consider the general class of systems that are discrete, or 
state-descriptive A particular characteristic of these systems is that they 
produce changes at specific intervals of time, rather than continuously. 

For such systems, an approach to system definition that uses lists of 
entities (elements or transactions) and their attributes allows us to produce 
a repetitive series of snapshots of system status leading to a “moving 
picture’ of system operation. In the following description we use the 
definition of lH ™ "*<• in SIMSCHIPT, a computer simulation language 
developed for the Air Force by the RAND Corporation. 4 

The SIMSCHIPT Approach 

SI MSCniPT 'icnsas^tc,. 1 as composed of entities that may bo permanent 
° temporary, and that hare attnbutes again permanent or temporary. Kor 

Wadnrenh. H.Jntoat'c.W L 1063 a chl^n's" W ftt Data r! >rDcestlns ond Drosreramfiig, 
Autoamrf D.U Smot. <S;,r¥ P r ?; 01 0 T - <=!>!“>. "AUTOSATE: An 

S»u Monied Trc! “' 1 "- < M ““ BM- 3 U 8 -PR) The BAND 

Ureter. A Simulation Programming 
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example, an employee, John joncs, may be considered a permanent entity 
if lie is on the regular payroll, or a temporary entity if he is a part-time 
worker. Jones has the permanent attribute “male" and the temporary 
attribute "age/* 

la addition to attributes, an entity may belong to a set or be the owner 
of a set of other entities. Thus, Jones may belong to the entity called 
Department 25, and Jones may own a set of entities, his personal hand 
tools if he is, say, a mechanic. 

At any time we can describe the status of the system by listing all 
existing permanent and temporary entities with their attributes, set member- 
ships, and set ownerships. 

A change in system status, as defined above, is called an event An 
event is said to occur, and n change in system status results, when one 
or tnare of the following changes occurs: 

1. The creation or destruction of an entity. 

2. A change in an entity attribute. 

3 . A change in entity set membership or ownership. 

Events may occur due to external pressures arising “outside" the defined 
system boundary, in which ease they are called exogenous events. Events 
may also occur due to the internal operations of the system and are then 
called endogenous events. 

To complete the system definition, the analyst must define completely 
the rules that cause a sequence of events to occur in simulated time. The 
development of the event-causing rules completes the model, or system 
definition, by relating one list of entities, attributes, or sets to another 
later in time. 

This listing or classification approach has the virtue that the rules of 
operation can be as complicated as necessary to produce realism-highly 
conditional rules, fixed or variable delays, random choice, and combinations 
of such rules are easily introduced by establishing additional lists and test 
procedures. 

When the starting status of the system and the rules for event creation 
have been established, the later operation of the system has been defined, 
and the rules can be followed from one event to another by a computer. 
The memory storage of an electronic computer abo permits ease of fist 
maintenance, because the memory media are fieri hie: List entries can be 
created, erased, and rearranged in microseconds. 

After "running” the defined system for a time that is sufficient to ac- 
cumulate a number of events, the characteristics of the system can be 
analysed statistically by reference to summaries of events, or the history of 
status changes. Moreover, the approach js perfectly general, so that the 
size and complexity of the system to be defined and simulated is limited 
only by the ingenuity and resources available to the analyst. 



, 00 Ctasifcation a nd M.rarcllfes In SyMtr, DellnUlon 

The reader will noUce that the SIMSCMPT approach to system defini- 
Uod rs no more than a mechanization of the classification nrcthods pre«; 
onsly discussed. The only detad of mechanization of additiona mtcrcst 
m this chapter is the concept of a temporary event notice list and an 
event timing routine. These ideas are illustrated in the following eaamplc. 
(Other forms of simulation timing will be discussed jn Chapter 1-). 


The Outline of a SIMSCRIPT-Type Problem 

Example. Consider a simple state-descriptive system consisting of a 
single work station that irregularly receives work and processes these 
fobs in shorter or longer times, depending upon the job to be done. 
(This is the classic queuing, or waiting line model.) 

To define the system by the classification method, we first define 
three permanent entities, two of which belong to the other: 


A. The system, 

1. The backlog or queue; and 

2. The work in process. 


These entities and their hierarchy define the lists of system operation 
that will be retained. Thus, the simple system is composed of the entities 
backlog and work in process. These entities will, in turn, own lesser 
temporary entities, i.e., jobs. 

We must now develop three types of event-creating rules. One type 
of event will be the arrival of orders (an exogenous event, because 
such orders enter the system). Another type is the endogenous event, 
which denotes a job completion. If a job arrives when work is in process, 
so that the incoming job cannot immediately enter the w ork-in-process 
category, the incoming job must join the backlog. Therefore, a third 
type of event can occur: The movement of a job from the backlog to 
the work-in-process category. Rules for the generation of each type of 
event are required. 


For example, suppose we have an exogenous event generator that 
causes orders to occur at succeeding intervals, according to some statis- 
tical distribution. Suppose we also have a job completion generator, 
which determines the time required to do a given job according to some 
statistical distribution. (The SIMSCRJPT computer language provides 
for these statistical operations, but they could be constructed according 
to e methods of Chapter 12.) Finally, we can specify the third type 
of event generation by stating two rules: 

1. If a job arrives and work is in process, assign Ibat job to the 
baton, of the backlog bsL Otherwise. g„ to work £ process. 

,k"° u P r° CeSS “ and a job is waiting in backlog, 

move tie ,ob at tbe top oi the backlog list to work in pLess. 
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If we assume we have these cv ent-creating devices, the system 
operation can now proceed. The simulation starts, say, at time zero, with 
both tlie backlog and w ork in process empty. The order arrival generator, 
however, tells us that an order will arrive at time 5. To control the 
progress of the simulation, we now create a tcmporanj event notice list , 
and note in tin's special list that an order will arrive at time 5. Because 
no other events are scheduled until then, our time recording device can 
jump to time 5, at which point the temporary entity, order I, is created 
and system status changes. 

Now we must perform several steps. First, we ask when the next 
order will arrive, and find from the order generator that one will arrive 
at time 15; we record this fact in the temporary event notice list. Next 
we apply the rules for the assignment of the incoming order, and assign 
order 1 to work in process. Finally, we ask when order 1 will be com- 
plete and determine from the order completion generator that it will 
be completed at time 19, this fact is also placed in the temporary event 
notice list, which is maintained in strict time sequence. Thus, the event 
notice for order Is completion will appear after the event notice for the 
next order arrival. 

Our clock may now jump ahead to time 15, at which time order 2, 
another temporary entity, is created. Again, we ask when the next order 
will arrive, and the input order generator gives time 23. By applying 
the rule for order assignment, order 2 goes to the backlog list, because 
work is still in process. System status now shows one order in backlog 
and one in process. 

Having completed the cycles to this point, the clock can now be 
moved to the next item on the temporary event notice list, time 19, the 
completion of order 1, presently in process. At this time the temporary 
entity order 1 is destroyed, because it now passes from the defined 
system, and order 2 is moved from backlog to work in process. Because 
a new job has arrived in process, we ask when the job will be complete 
and find from the job completion generator fame 27 for the completion 
of order 2. This future event is noted in the temporary event notice list 
in its appropriate time sequence. The simulation continues in this way, 
step by step, until the analyst stops the process. (See Figure 4-6.) 

In the above example, many statistics of system operation can be 
collected as events occur and the status of the system changes. The average 
backlog, the standard deviation of the backlog size, the maximum size of 
backlog, and the average time for an order to pass through the system are 
available from summaries of the data generated by the simulation process. 
Moreover, we can use any form of distribution to govern the input and job 
completion generators, and with only little added complexity of formula- 
tion, we can extend the simulation to series or parallel flows. 
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The reader will nobee that the SIMSCRIFT approach to system defini- 
bon .s no' more than a mechanization of the classiEcabon methods prevt- 
ously discussed. The only detail of mechanization of addition^ intere 
m this chapter is the concept of a temporary event notice list and Mi 
event timing routine. These ideas are illustrated m the following examp . 
(Other forms of simulation timing will be discussed in Chapter U). 


The Outline of a SIMSCRIPT-Type Problem 

Example. Consider a simple state-descriptive system consisting of a 
single work station that irregularly receives work and processes these 
jobs in shorter or longer times, depending upon the job to be done. 
(This is the classic queuing, or waiting line model.) 

To define the system by the classification method, we first define 
three permanent entities, two of which belong to the other: 

A. The system, 

1. The backlog or queue, and 

2. The work in process. 


These entities and their hierarchy define the lists of system operation 
that will be retained. Thus, the simple system is composed of the entities 
backlog and work in process. These entities will, in turn, own lesser 
temporary entibes, i.e., jobs. 

We must now develop three types of cveat-creating rules. One type 
of event will be the arrival of orders (an exogenous event, because 
such orders enter the system). Another type is the endogenous event, 
which denotes a job completion. If a job arrives when work is in process, 
so that the incoming job cannot immediately enter the work-in-process 
category, the incoming job must join the backlog. Therefore, a third 
type of event can occur: The movement of a job from the backlog to 
the work-in-process category. Rules for the generation of each type of 
event are required. 


For example, suppose we have an exogenous event generator that 
causes orders to occur at succeeding intervals, according to some statis- 
tical distribution. Suppose we also have a job completion generator, 
which determines the time required to do a given job according to some 
statistical distribution. (The SIMSCRIPT computer language provides 
for these statistical operations, but they could be constructed according 
to the methods of Chapter 12.) Finally, we can specify the third type 
ol event generation by stating two rules: 

I. If a job arrives and work is in process, assign that job to the 
bottom of the backlog list. Otherwise, go to work fa process. 

- It work in process is completed and a job is waiting in backlog, 
mosc the ,ob at the top oi the backlog list to work in process. 
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If wc assume we have these event-creating devices, the system 
operation can now proceed. The simulation starts, say, at time zero, with 
both the backlog and work m process empty. The order arrival generator, 
however, tells us that an order will arrive at time 5. To control the 
progress of the Simulation, we now create a temporary event notice list, 
and note in this special list that an order will arrive at time 5. Because 
no other events are scheduled until then, our time recording device can 
jump to time 5, at which point the temporary entity, order X, is created 
and system status changes. 

Now we must perform several steps. First, we ask when the next 
order will arrive, and find from the order generator that one will arrive 
at time 15; vve record this fact in the temporary event notice list. Next 
vve apply the rules for the assignment of the incoming order, and assign 
order 1 to work in process. Finally, vve ask when order 1 will be com- 
plete and determine from the order completion generator that it will 
be completed at time 19; this fact is also placed in the temporary event 
notice list, vvliich is maintained in strict time sequence. Thus, the event 
notice for order l's completion will appear after the event notice for the 
next order arrival. 

Our clock may now jump ahead to time 15, at which bme order 2, 
another temporary entity, is created. Again, we ask when the next order 
will arrive, and the iDput order generator gives time 23. By applying 
the rule for order assignment, order 2 goes to the backlog list, because 
work is still in process. System status now shows one order in backlog 
and one in process. 

Having completed the cycles to this point, the clock can now be 
moved to the next item on the temporary event notice list, time 19, the 
completion of order 1, presently in process. At this time the temporary 
entity order 1 is destroyed, because it now passes from the defined 
system, and order 2 is moved from backlog to work in process. Because 
a new job has arrived in process, we ask when the job will be complete 
and find from the job completion generator time 27 f or the completion 
of order 2. This future event is noted in the temporary event notice fist 
in its appropriate time sequence. The simulation continues in this way, 
step by step, until the analyst stops the process. (See Figure 4-6.) 

In the above example, many statistics of system operation can be 
collected as events occur and the status of the system changes. The average 
backlog, the standard deviation of the backlog size, the maximum size of 
backlog, and the average time for an order to pass through the system are 
available from summaries of the data generated by the simulation process. 
Moreover, we can use any form of distribution to govern the input and job 
completion generators, and with only little added complexity of formula- 
tion, we can extend the simulation to series or parallel flows. 
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TIME 

BACKLOG 

is PROCESS 

temporary event notice 

0 



Order 1 arrives at t = 5 

s 


Order 1 

Order 2 arrives at t ~ 15 

Order 1 complete at t s= 19 

15 

Order 2 

Order 1 

Order 1 complete at t = 19 
Order 3 arriv eS at t — 23 

19 


Order 2 

Order 3 arrives at t — 23 

Order 2 complete at t J= 27 


4-6 Classification method of system definition using SlMSCRIPT to: mat. This figure 
summarizes system status as described m the text. 


Note that by using the temporary event notice list, the simulation can 
plotted from one change in system status to another thereby eliminating 
the unnecessary repetition of computation and display that could result 
if system status reports were produced periodically. It should also be clear 
that the time scale used in the simulation can be changed to suit the 
analyst's needs. The SIMSCRIPT computer language provides for these 
adjustments and also prosides the analyst with numerous, convenient in- 
structions for handling a list creation and the timing routine illustrated above. 


PROBLEMS 


4 1 The following statements can be simplified, using logical classification, to 
give tl*e required answer First determine the important categories and the 
attributes of each category It may help to construct a small table showing 
all comb mations 

Here are the facts: 


1. There are Gv e houses. 

2. Tlie Englishman fives in the red house. 

3 Milk is drunk in the middle house. 

4. The Spaniard owns a dog. 

5. The Japanese smokes Parliament 

6. The Norwegian lives in the first house. 

7. Kools arc smoUJ in the louse Hex! lo ssherc Die horse U left. 

. The man who smokes Chesterfield lives next door to where the fox 
is kept. 

9. Tlie man w ho smokes Old Cold owns snails. 

10. The man wlio smokes Lucky Strike drinks orange juice. 

1 V Tr i firct ' x ‘ « immediately to tlie right of the ivory house- 
- kook 3X0 nun in the > cllow house. 
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13. Coffee is drunk by the man in the yellow house. 

1'h The Norwegian lives next door to the blue house. 

15. The Ukramon drinks tea 

10. Each man has a different house, drink, smoke, pet, and nationality 

A. Who drinks water? Who owns the zebra? (There is a unique answer for 
cadi.) 

B. Comment on your method of solution Did organizing the classifications 
and attributes help? Did trial-and-crror play a part in your solution? What 
other tcclunqucs did you use? 


(This problem assumes the reader is familiar with the material in Appen- 
dix C.) 

A manufacturer of men’s clothing has developed a code for men’s suits 
as follows: 


Number Sold Stock Number Customer Number Salesman 

in units size cut color style fabric region city serial 

XXXX 30 R 3 21 24 1 3 6 4 


1. Number sold: 

2. Size: 

3. Cut: 

4. Color: 

5. Style: 

6. Fabric: 


7. Region: 

8. City: 

9. Customer: 
10. Salesman: 


Units ordered (4 character positions for digits) 

Men’s suit size. ( 2 positions ) 

S small, R regular, L long, X extra long. (1 position) 

1 blue, 2 brown, 3 grey, 4 black; 5 other. (1 position) 
Manufacturer’s style number keyed to pattern. (2 posi- 
tions) 

Manufacturer’s fabric and price level code, first digit is 
price class, second digit is fabric code in price class. (2 
positions) 

1 northeast, 2 mid-Atlantic; 3 south; 4 north-central; 5 
mid-west; 6 west. (1 position) 

Serialized within region. (1 position) 

Serialized within city (1 position) 

1 Herman; 2 Cy, 3 Al, 4 Irving; 5 Walter; 6 Jack, etc. 
(1 position) 


A. Given an excerpt of the hypothetical transaction listing for this firm for a 
given month (see below): 

1. How many longs did Herman sell on the west coast? 

2. What is the distribution of shorts, regulars, longs, and extra longs by 
region? 

3. What is the most popular color this month? 

4. What should we send to Charles and Co , if the number of that store 
is 346? 

5. If the wholesale price of 34L21121 is $39.80, what is the dollar sales 
of this item? 

6. What region had the greatest unit sales? 
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B. «U comprranas have been made .n the development of the given 

^cleld you suggest a mote desirable code structure But would sattsfy 
both the need for interpretation and analysis and the need for code effici y 
in terms of compactness? 

D Why is A desirable to relate the statics by product code wth tne 
statrst J by customer code? Wbat other files should be organized around 
these two basic code formats? 


EXCERPT FROM TRANSACTION FILE 


100 32SU121 111 4 
200 32511114 321 4 
1000 32S12136 412 1 
200 32S24321 523 2 
1000 34R1H21 132 4 
75 34R22221 222 3 
250 341111314 146 4 
300 34R22330 452 7 
1000 34R33419 636 6 
200 34R1U19 171 4 
200 34L21121 382 5 
2000 34L32236 063 0 
1000 31X41119 117 4 


200 36S13221 221 3 
50 36R22121 340 5 
100 36R23119 346 5 
1000 36L11119 152 4 
1200 36L12236 441 7 
200 36X12634 631 1 
100 36X22221 663 1 
80 38R31323 542 2 
40 38L32323 511 2 
75 38X33240 512 2 
200 38X42236 513 2 
100 40L22119 221 3 
75 40X11221 646 1 


4,3 In the novel 1984, George Orwell referred to the "Office of Double-Think. 
Tim office had the function of updating the master files to suit the needs 
of die current administration. . . . When large-scale computer databanks 
are used for central distribution of information-through computer manipula- 
tions and electronic communication s> stems— vvliat precautions could be 
taken to prevent the outcome Orwell predicted? Give an example of a 
situation in uluch Orwell’s vision might come to pass. 


4 4 The Standard Industrial Classification Manual (bureau of die Budget) 
Government Printing Olfice, Washington 25, D.C., defines statistical cate- 
gories for U. S Industries. This 4-digit S.I.C. Code is arranged in a hierarchy. 
The first two digits indicate a major product group, the next digit a subclass 
within tliat group, and die fourth digit a sub-subclass. For example, Major 
Croup 35 includes Machinery, except heavy electrical. Code 357 indicates 
Office, Computing, and Accounting Machines. Code 3571 indicates Elec- 
tronic Computing Machines and other accounting and bookkeeping equip- 
ment, including cash registers The detailed contents of a given category 
axe described in die aforementioned manual which is revised from time to 
tunc as necessity dictates. Discuss die advantages of such a uniform system 
oi accounts and the possibility of difficulties to be encountered in uitcrpretmg 
daU reported on Bus basis, when used for historical or between-class 
comparisons. (Census figures, tax records, inter- and intra-industry trans- 
actions, geograplutal inventories of cstabhsluncnts and productive facilities. 
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to mention a few, are maintained by the government on an S.I.C. basis. The 
United Nations lias a similar classification for World Trade Statistics.) How 
do the disadvantages and advantages you have discussed relate to electronic 
computers and data-proccssing systems that use master files based on the 
S.I.C. codes? 


4.5 A village of 5000 inhabitants is considering improving its management. 
Better long-range planning lias been suggested as a major problem, and the 
mayor, who is also a real estate salesman on the side, has made up a partial 
list of some of the tasks his administration faces in management and control, 
He, however, does not have time to go into the detail presented by each 
of these problems, and seeks advice from you on the improvement of his 
administration. He has recently been elected and wants to show some 
progress in long-range planning by the next election, which is two years 
away. His list includes the following: 

1. Assessment of local taxes, tax billing and record-keeping. 

2. Handling zoning requests and building permits. 

3. Administration of the Water Department, including billing, main* 
tenance, construction, and repair. 

•1. Municipal payroll accounting, maintenance of pension and reserve 
funds. 

5. Management of street maintenance, repair, and lighting. 

6. Supervision of police activities and local court records. 

7. Maintenance of capital equipment records on buildings and equip- 
ment 

8. Planning, funding, staffing, and operatmg the grammar and high 
schools. 

9. Maintenance of dossiers on known undesirables and troublemakers in 
the community. 

10. Maintenance and follow-up of files on dog licenses, as required by 
the state. 

11. Maintenance of voter registration lists for local, county, state, and 
national elections. 

12. Maintenance of data pertaining to the Fire Department. 

13. Operations of the small but active Recreation Department. 

14. Other operations, such as snow removal, tree pruning, garbage and 
sanitation supervision, and the like. 

The mayor further explains that one of his major headaches is the projec- 
tion of the possible effects of trends in the community. For example, the 
State proposes to build a new superhighway, which will cut through a 
portion of the town, and a large builder has recently acquired a tract of 
land on which he proposes to build 50 homes in the $40,000 class. The 
mayor wonders how these developments will affect his tax roster, tax evalua- 
tion the schools, needed services, and his possibility for continued election. 

A cursory examination of the offices of this small government operation, 
which employs a total of less than fifty individuals, revealed that files are 
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kept on a number of the stated settees. but te “«l- “r e «ta ^ 
cumokte Many »erc also located u. separate locations. Hus ullage still 
has not automated most recortUceping tactions although the 
(by teal estate parcel) hove been reduced to machine medio for billing and 

accounting purposes. ... 

What form of study would you conduct to assist the mayor with 
problems, and what lands of questions would you ask in setting up your 
study? 


4.0 Bustness Week, May 14, I960, in "Information Becomes a Hot Item," PP- 
184-6Q, reported that International Business Machines Corp. liad just formed 
an Information Marketing Group. The function of this group is to sell data, 
and its first service will be the preparation of special industrial market 
reports This first service will combine Dun & Bradstrect’s file of 390,000 
U.S. businesses (95% of all manufacturing) and the U.S. Commerce De- 
partment's industrial input-output data, broken down into the full detail of 
4- digit S.LC. codes (see Problem 4-4). The customer gives IBM a list of 
its customers for specific products, sales or unit v olume, and sales territories. 
The computer searches the D & D databank and the government data and 
breaks out the client’s market potential and actual market share by purchas- 
ing industry, plant sue, territory and other precise categories. IBM's new 
service is paced at $3500 to §10,000 per report, a cost estimated to be Jess 
than that for accumulating conventional, less detailed data, 

A. Comment on the advantages m merging these two large databanks as 
indicated above. 

B. What are the advantages to the client and IBM of accumulating market- 
ing information from client requests? 

C Can you suggest any other large databanks that might be similarly 
merged. 

D. Under what conditions might it be necessary to have a fiduciary institu- 
tion control the information in a databank, rather than have the information 
controlled by a communication, computer, or other commercial firm? 


•4,7 (Although the following problem refers to a computer programing tech- 
nique, knowledge of computer programing is not necessary to understand 
the concept described. The reader unfamiliar with computer programing 
may consider the memory addresses mentioned in the following problem as 
numbered pigeon-holes in a post office, or sorting rack.) 

Some computer programs may require us to deal with lists of variable 
ength, which may be stored in various portions of the computer memory. 

3 T*** a p P n° ath ’ * e cxact iength of a fi> ven tet need not be 
Wn m advance. Following A. Newell (Ed.), 1 nfomethn /Wring 
Le„s»ge Manual, PeenUee-KaD, Englewood CMs. N. J, 1961. consider 
Vbe i duo wing arrangement; 

v.md'hS? !° em ° ry i i <M ' d ' MO a ” uml,et ° f four-character (fired 
d length) storage Warns, each oi which is numbered consecutive!)' 
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from 0001 upward, as shown in tire diagram below (tire italic type repre- 
sents Uie storage address). Information stored in this memory is then 
arranged in pairs of numbers, which, let us say, will consume two four- 
character storage locations. The first four-character position is used for re- 
cording the data part of the list, the second four-character position indicates 
the storage address of the data-item that follows in the desired listing. If, 
in addition, we adopt the convention that “0000” designates the last address 
on the list, and that we will have a special memory location which will 
always be used to indicate the location of a “free” data position, we will 
be able to record lists of varying lengths, and add to and delete from these 
lists without loss of memory spaces. Moreover, one list may contain other 
lists, and so on, the degree of interconnection between lists being determined 
by the "next-address” portion of the records stored. 

For example, see the tabulation below, which represents a series of data 
and next-address information stored as specified above. The list starting at 
address 0001 consists of the numbers 2233, 3111, <1010, and 0222, at which 
point the list terminates. 

Note in the table that the memory locations which are “free” for data 
storage, namely, 0007, 0009, 0011, 0013, 0015, 0017, 0019, and 0021 have 
already been provided with next-address links (The 0000 next-address of 
data location 0021 stored in address 0022 indicates that when 0021 has 
been used, the data locations available have been completely exhausted. 
This wall be so if there are no other open data locations.) 

Two-part word 


0001 

2233 

0002 

0023 

0003 

4010 

0004 

0005 

0005 

6222 

0006 

0000 

0007 

0008 

0009 

0009 

0010 

0011 

0011 

0012 

0013 

0013 

0014 

0015 

0015 

0016 

0017 

0017 

0018 

0019 

0019 

0020 

0021 

0021 

0022 

0000 

0023 

3111 

0024 

0003 



0007 


Next Available 
Address 
Reference 


Suppose we now want to add the number 5333 to list #1 starting in 0001. 
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The procedure for Air «o«ld Sf •» [nOn-n-s, 

1. The 0000 nextaddress of *e last data element i«W 
replaced by the 'ne.t-aia.laWe' empty address, Moral in 

taion tntated. To provide a nets 'neat-addreo.' tl.e 0003 nc.t-aWr . 
ninth is associated .nth neat data address (0007), U placed 
reserved next avada&fc address location. 

2. The data 5333 goes into the address location 0007 and 

link for address 0007 (namely the 000 9 presently stored to address 0005) is 


changed to 0000. indicating a new listing ending. 

3. To delete the data 3333 from the present list, the procedure above 
would be reversed, thereby mabng the address location 0007 available again 
for an alternate list. 


A Suppose, after studying the procedure above, uc discovered that memory 
locations 0007, 0009, and 0011 were already filled with a second list, Ust #2. 
and that the next available free address location wax 0013, as shown by the 
special memory box for that purpose We again want to add the data 5333 
to the list starting in memory box 000J . SVhat must we do to accomplish 


B- Having completed part (A) above. «c now want to add the data 7945 
to the bottom of list #1. suiting in address 0001. How would this be done? 

C. Suppose you wanted to add list #2, as described in part (A), to the 
top of hst #1. How would you accomplish this result? 

D. Why would llux procedure be a useful tool for defining the system opera- 
tion over a period of time? 

For further illustration, see the reference cited, ft contains a 31-itcm bibli- 
ography and several detailed applications. 


A manufacturing firm has a number of files containing information about em- 
ployees. A study of these files reveals much duplication in the following areas; 

1, Payroll file. Contains employees name, address, social security number, de- 
partment number, special pay record, number of dependents, major medical 
deduction, insurance deduction, annuity deduction, state tax deduction, FJCA 
record, pay rate class, eanungs to date this year, mail address for check, if 
different from residence. Blue Cross/Blue Shield deduction, 
cv. va th Imurai ! Ce file ‘ ^ntaisis employee’s name, address. Blue Cross/Clue 
Shield account number and status, monthly deduction, dependents by name, age, 
su, date of insurance contract lot each, changes to to.erage „ilh date and type. 
' , ,T t r oa ’ J " a °«« h. the Payroll Department,) 

addietf'nST’S- 1 ,1W M ' rfi ' al “I 14 0e: Con, atos employee's name, 

«* "tone, age, sex, date of coverage, and changes to 
numte employee's name, sex, age, address, contract 

. emp Intuition, tax option selected by employee, terms of con- 
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organization about a giv cn employee or group of employees, anti evaluate each 
proposal for Sic organization and each proposed code with respect to Its ability 
to answer the proposed list of questions. 

D In nulung such an c\al nation, would the importance of the questions that 
can and cannot be answered by the proposed system male any difference in tire 
evaluation? Would the frequency of the questions asked make a difference in the 
file organization and code evaluation 5 Arc there some portions of the file that 
should not be consolidated? What arc some of the advantages and disadvantages 
of consolidation of die suggested files? Would the problem of access to this con- 
solidated master file be important If so, what provisions could be taken to provide 
security where required? 

E Over a period of Ume, the employees listed in the master file may change, the 
details pertaining to each may change, and the questions that may be asked 
about an employee or group of employees may change. Have you provided for 
the updating of the master file in such a way tliat its over-all organization is not 
greatly altered? Would the file organization and coding system proposed be 
suitable for answering questions that are not on your original list of possibilities, 
but which might arise in the future? 

F Suggest some long-range planning questions tliat might be answered by use 
of such a consolidated master file. Suggest some short-range control applications 
of the master file. 



5 


CONTROL SYSTEM 

HIERARCHIES 

AND SYSTEM MEMORIES 


In the previous chapters we discussed some methods for defining system 
structure. These methods showed the relationship between operations and 
developed techniques for predicting system output, given system input. 
We indicated that such prediction became more difficult for the observer as 
tile “memory function" of the system became more extensive, because the 
system could then take diverse actions in response to input conditions. 

T7iis chapter elaborates upon control hierarchies so that a scheme for 
viewing large-scale systems will be available to us. We now consider some 
general system properties, which determine the “level” at which a system can 
predict and control. We will observe the system in this chapter from the de- 
signers viewpoint, i.e., in terms of the system’s purpose, which is known to us. 

FIRST-ORDER PREDICTION AND CONTROL 

The simplest system that can exert control is the simple feedback arrange- 
ment shown in Figure 2-10. Here, in addition to a given transformation, the 
system measures its output against a desired input and corrects for the 
immediately observed difference. No memory function is involved. 

For example, the operation of a household furnace alone is to transform 
fuel into heat. When the furnace is turned on, the heat starts and continues 
until the furnace is turned off. The addition of a thermostat, however, intro- 
duces first-order feedback. The homeowner sets a desired temperature, and 
the thermostat, a regulator, turns the heating system on and off at tempera- 
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tares above and below the desired setting, thus giving an average tempera- 
ture near the set value. (A more expensive continuous controller could be 
achieved by increasing or decreasing the fuel flow to the furnace m response 
to temperature.) The first-order feedback system has three parts: A receptor 
{or sensor } to receive the temperature reading in the house in this cose, 
a detector ( or summing point) to measure the difference b tween the receptor 
measurement and an input standard, and an effector (the furnace) to cause 
the change required. The response oE this simple system will vary depending 
on the characteristics of the receptor, effector, and input standard ( e -6-> 
the furnace with a small boiler and piping system will have a quicker 
response than one with larger components), but a direct relationship can 
be calculated between these factors. The heating system is a complete slave 


to its design and input setting. 

A control system, even at the first-order level, has the property of protect- 
ing the internal system serviced (the house) from external changes in the 
environment. Thus, the function of the heating system is not merely to turn 
the heater on and off, but to perform this act so that the householder is 
unaware of external temperature changes. Generally, this control system 
function and performance is so. The "perfect" control system would perform 
the job and eliminate the problems of environmental vagaries. And we would 
ne\er be aware that the system was working. 

Another example may clarify the point. Suppose we had to mill blocks 
of steel to a given thickness. One way we might accomplish this would be 
to set the cutter, start cutting, and not make any adjustments for the tool 
wear or other environmental changes until some later inspection step indi- 
cates something has gone wrong. First-order feedback can be introduced by 
placing a sensor just after the cutting head and measuring the cut just made. 
Any error so detected is fed back to an effector which alters the cutting 
bead to the correct value. Again, because the purpose of this system is to 
Cut blocks of a given thickness, the introduction of feedback protects the 
system purpose from changes in the environment. 

The reader can easily suggest other illustrations of first-order feedback. 
In first-onler feedback no explicit "memory** blocks appear. The required 
memory to correct error into direct action is built into the system structure. 
The function of first-order feedback is usually to hold a system at an 
equilibrium condition, 1 or to provide a rudimentary “following” or "track- 
ing ability. 

The first-order system cannot predict or make a conditional choice of 
actions. 


’ t ' od “ ra tqvibrtmn u ollm calM a -bommuT 
1 ? U .operation !l taU «l Historically, \V B. Cannon 

suggested twi te- to hi* classic WUdom of the Body (Norton New Y OI k! 1032} The 

of ’ “n* b r° d pr . eSsme are a few c “f’P lc * 

flucacnl the development of csbcmJw b °*J y ,Y Dr ; , C *? nOD * Presentation greatly in- 
ful system*. cjbemettes and the interdisciplinary design of purpose- 
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Now, let us introduce some memory into our system. In the thermostat 
example, suppose \vc want to change the temperature setting at different 
times of day. These settings can be stored by mechanical, electrical, or other 
mcans-they can then be selected at die appropriate time by a clock. 

Or, suppose we introduce other sensors: an outside thermometer and a 
wind-speed indicator. By offering it added information, these sensors enable 
the heating system to improve its operation by anticipating future heating 
requirements— Lc., if the system can “remember” the heat-loss formulas for 
a house under wind velocities and differential inside-outside temperatures— 
and by evaluating the available data for action. 

This second-order, commercially available heating control system is obvi- 
ously more complicated and more expensive than the simple thermostat. 
It contains not only a receptor (or more) and an effector (or more), but also 
memory storage equipment, computing or more complex transformation 
equipment, and coordinating ability to determine the necessary course of 
action, based on the input information. (See Figure 5-1.) If its components 
are properly balanced, the operation of this second-order system will further 
reduce the error between system purpose and system results. The essential 
ingredient of the second-order system is its ability to initiate various actions 
in response to changes in input conditions and to separate its actions from 
an immediately slavish “following” to some measure of self-prediction and 
control. Here the timing of response is a critical factor in determining the 
success of the system. 

Example: To illustrate the use of memory and prediction, consider the 
marketing department of a firm that has developed sales plans A, B, C, 
and V to use under specified changes in the market, anticipated as con- 
ditions 1, 2, 3, and 4. The optimal choice of plan for each condition is 



5-1 A higher order control system. This figure Illustrates that a memoiy and control 
device can be used to give a receptor and effector increased ability. If conditions 
change, the memory and control device can prov.de alternate plans or direct the 
effector to anticipate the action required to maintain control. In advanced systems the 
memory and control device can also direct the receptor to be aware of alternate 
problem areas. 





114 Control System Hierarchies and System Memories 


shown in Figure 5-2 by an “X" and the "decision rule” for making the 
hest choice can be predetermined and built into the system. 

If the firm has the ability to shift its operation from plan to plan 
at once, then, when conditions change, the optimal series of plans (ie., 
an optimal policy) can be followed automatically. 

Suppose, however, that some time is needed to change from one plan 
to another. Then, in addition to a variety of reserve plans and corres- 
ponding decision units, the marketing department must anticipate far 
enough in advance when conditions will change so that the appropriate 
plan can be put in effect before conditions change. The optimal policy 
can be followed only if the system can predict. 

The classic example of the second-order control system is the “fire 
control” or hunting problem. If a hunter hopes to shoot a duck in the 
wing, he will not aim at the bird’s present position— he will "lead the 
target.” Many factors will determine where and how much to lead- 
target distance and direction, target velocity, wind, ballistic factors, 
observations of present target evasions, and often the type of duck (coots 
fly differently from mallards). In this example the appropriate use of 
prediction and control is easily tested by success in hitting the target. 
The hunter will be more successful, in general, as his speed at evalua- 
tion and action increases, as his memory of alternate courses of action 
increases, and as his ability and speed to correct for errors increases.* 


In each of the examples above, we would say that the second-order sys- 
tems described showed more “self-control” than the first-order systems of 
the previous section, i.e., second-order systems are more "autonomous.” 
The second-order system is not a complete slave. It makes some decisions— 
based upon information and decision rules provided to the system by the 
designer— by itself. It can handle a greater variety of input conditions. 

However, the step to the second-order system also presents additional 
problems of system design and system “optimization.” For example, to 
achieve a given purpose, how much memory search and evaluation will be 
iwed before decision and action? Too much search and evaluation may slow 
the system response beyond usefulness; too little search and evaluation, 
however, may cause the system to commit gross errors. How should the 
economics of system operation be allocated, say between size of memory 
and speed of computing components, or between the precision of receptor 


offered rtoic proof of 

SjhwSEU, ” ” v™ ™ ' 01 »“> »<“Wt fire 

arfh. Wheel free, „ 5 ®* 6st, “ P>»e p.lotr 

«5ge. because ddferent evasions could be fi t ?“ ***,!- 

rtonng typical evasions by sdmol ard *. d f erences m instruction. By 

“ fund (to predict iota. f *“«•< 

drastically improved. 8 P 0551 ' 00 *) the number cf successful shots was 
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Plan to use 



5-2 The simplest memory ol alternate plans. A change in input conditions initiate: 
an alternate mode ot operation as shown. With advanced systems, a wide range o 
plans, or combinations of plans, would be available to meet a wide range of alternate 
conditions. Compare the rigid plan selection above to the coin-changer example o 
Chapter 2. The wider range of aJternabves available with a greater range of alterna 
lives stored in memory will be discussed more fully m Chapter 6. 

and effector components as opposed to the duplication of components re- 
quired for reliability and maintainability? 

Thus, the second-order and higher-order systems, which offer great 
benefits, also provide the designer and user with major problems not only 
of analysis, but also of choice, adjustment, and value. And these problems 
become more severe as the system becomes more complex. 


MEMORY CHARACTERISTICS IN SYSTEMS 

We must now digress and discuss some of the detailed properties of the 
memory function in systems. This discussion will then permit us to introduce 
the notion of yet higher-order systems into our hierarchy of system types. 

Symbols and Abstractions 

Although occasionally we can store actual physical items for later use 
(special tools, cash, inventory personnel), memory usually implies the stor- 
age of abstractions, or symbols (a tool number, a bank balance, a stock sta- 
tus, a list of employees ). 

Sometimes we store symbols because it is convenient, other times wc 
store symbols because another form of storage is impossible. Thus, a road 
map is an abstraction used for storage because we cannot keep the entire 
countryside in our pocket. Our abstractions differ depending on our pur- 
poses: The map suitable for the private car owner is unsuitable for the tax 
collector or the census taker. 
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Id particular as we move to higher-order systems and the components 
removed from receptors and effectors, we find that the operations wo must 
perform are usually upon symbols. 

The corporate manager seldom performs operations on a physical product 
-mostly he concerns himself with abstractions at various levels of dejai): 
balance sheets, cost statements, engineering reports, correspondence^ con* 
cepts’ of management, abstract values such as “opportunity cost, and 
social and philosophical considerations, such as community welfare and 
the development of alternate goals and values. 

For this reason, some principles of abstraction are useful when we dis- 
cuss memory. Here we restrict our attention to a few relations between 
symbol formation and memory ability. 

Principle Ho. 1: Symbols used m memory can ba expressed In many equivalent yet 
unique terms provided a one-to-one translation Is possible from one set of symbols 
to another. 

Example. Customer Jones can be called customer l£), or ABC, or (01, 
10, 11), etc., and a translation between these formats is possible pro- 
vided the symbols are uniquely related, e.g., 

1 A 01 

2 -* B 10 

Principle No. 2: The most convenient choice of symbols is determined by at least 
lour tactors. including (al the memory media, tb) the operations to be performed on 
the symbols, (c) the slia ot the memory facilities, and (d) the required speed ol 
access and retrieval 


Example; The ultimate storage of symbols in an electronic computer is 
in the form of electronic charges or magnetized components that can 
have only two states, "on-off," or "0-1 This "binary" characteristic of the 
stotase media requires that the inioimation to be stored be translated 
mto (and lot retrieval from) such binary numbers. Using a "pure 
binary code, we could represent ft! distinct numbers from 2 ero to 63 
with five binary positions by counting in powers of two. The number 37, 
lor example, could be represented as 

l X 2* -V- 0 X 2* 4- 0 X 2» + 1 X 2* + 0 X 2* + 1 X 2 a 

= 32 4-4 + 1 = 37 . 

^■“ 1 *'““'*' by the positions of the powers of two in the 

Iraoi Tsw, r,? at btoMy latent of decimal 37 is 

1 10!. (Similarly, the pure binary equivalent of decimal 83 is 1111!, 
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etc.) Pure binary is the most efficient form of encodement for the 
computer when numbers are used exclusively, as is often true in scientific 
computation. 

However, when business data are processed, we use many alphabetic 
and special characters. For such applications, an alternative code form 
is often used. Six binary positions are reserved for each numeric and 
alphabetic character. Four of these six positions, or "bits,” provide six- 
teen possible combinations, or codes, by using the powers of two from 
2 s through 2° (the binary equivalent of 8-4-2-1). The remaining two bits 
(often called B and A bits) identify four hlocks, or zones, each with six- 
teen possibilities provided by a repetition of the four-bit 8-4-2-1 detail, 
as illustrated in Figure 5-3. 
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5-3 A six-bit coded decimal conversion chart (IBM 1401). The table shows (to the 
extreme right) the 8 -4-2-1 binary detail associated with each character m a given line. 
Two additional zone bits, the same for each of 16 characters in a zone block, are 
indicated in four columns. In combination the zone and detail bits provide 64 charac- 
ters In computer memory (first column each zone as defined tor the IBM 1401), of 
which only 50 are usually available for printing (printed symbol shown in second 
column each zone). The special, nonprinted characters are used for internal control 
purposes. A seventh, check or parity, bit used in most computers is not shown. It is 
important to understand that the code “11 0001” shown as “A” does not necessarily 
mean "A" to the computer, but is assigned that value by people using this particular 
equipment. The code "11 0001” — as well as the other 63 codes shown — could just as 
easily stand for any other character in any 64-character set; the desired character is 
placed on the input keyboard and output printer in a position that provides consistent 
translation. 
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By reference to Figure 5-3, the decimal number 3 is given by the 
binary coded decrmal form (BCD code) 00 0011, and the deernra 
number 7 is given by 00 0111. Thus, in BCD code, the number 37 is 
given by the 12-bit combination 00 0011, 00 0111. (The same decunal 
number, 37, coded in pure binary would have consumed only 5 positions 
in computer memory, as previously shown.) 

Nevertheless, the BCD code, as shown in Figure 5-3, allows alphabetic 
characters, such as the letter “A," to be coded with only 8 positions, 
namely 11 0001, where the first two positions to the left are the “zone 
bits" and the four to the right are the 8— 1-2-1 binary detail. Similarly, 
“B" would be 11 0010, and “C” would be 11 0011, and so on. If all the 
characters in a 64-item character set are used with equal frequency— an 
ideal approximated by the requirements of commercial data processing— 
the su-positions-per-character of BCD code approaches the efficiency 
of the pure binary scheme for numeric data. The BCD code also permits 
efficient translation of alphanumeric data from standard punch cards 
to internal computer storage and back again to printed output using sim- 
plified electronic circuits. Ease in handling alphanumeric data makes the 
BCD code format preferred for business data handling. Thus, a change 
in code format may be desirable to match storage facilities to the most 
common input/output or common computational requirements. 
Although we used a computer example here, the problem of symbol 
choice and translation is general, and also applies to other media. 


Principle No. 3: ll part ot a symbol can ba duplicated or reproduced by reference 
to another symbol (or set of symbols) stored in memory, that portion of the new symbol 
is redundant 


Example. The classic example is the structure of language. Typical 
printed English is about 302 redundant in the sense that the reader 
can fill in the missing letters (or words) of many messages from his 
knowledge of the language structure. Most people will recognize 
CNURD THS TXT? 

as Can^you read this text?" (This fill-in ability is the basis of the “Speed- 
wnting method of shorthand notation.) 

Similarly, if the details of “Plan A” are stored in memory, the instruc- 
tion Use Plan AT or simply “A" need not be followed by a repetition of 
“* , , ■ tie user understands that reference to the stored plan is 
required. K 


I*" ' 1 "* "°- 11 Tho a “™“' 01 '"'ormanon contamed In a symbol can be measured 

mTst be asked", Tam ° 01 " mM1 >™-no" Cues.lons dial 

must be asked to detenu, na a missing element In a message 
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In this questioning process, the minimum number of questions required 
will depend upon the content, size, and organization of a system’s memory, 
and the method of symbolization used. 

Example: Suppose one contestant is allowed to select (at random) one 
of the 32 characters in Figure 5-4, and a second contestant is to find the 
selected character by asking the first contestant the minimum number 
of questions that can be answered yes or no. 


A 

D 

c 

D 

E 

F 

G 

H 

I 

J 

Iv 

L 

M 

N 
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P 

Q 

U 

s 

T 

U 
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W 

X 

Y 

Z 


r 


; 

! 

t 


5-4 A table of characters for choice game 


The least number of questions that can be asked in this game is five’ 
(if the first contestant selects his character at random) and the most 
efficient sequence of questions is “Is the character in the upper 16?” 
"... in the right-hand eight?” ”... in the left-hand four?” etc. 

Because five questions are required to isolate the randomly chosen 
character from the set of 32, knowledge of the character is worth "five 
bits” of information, if we use the questioning measure of information 
content* 

Using the questioning measure, the reader may wish to verify that the 
amount of information for a five-letter group of characters randomly chosen 
(with repetition allowed) from the above table is 25. However, the amount 
of information contained in the five-letter English word QUIET is less (for 
the Englishman), because in English Q is always followed by a U. No 
guesses would be required to fill in this second character, if we knew the 
first was Q. 

Generally, then, the amount of information received in a message (or 
stored in one part of a memory) depends upon the amount and kind of 
information already stored in a systems memory and is measured by the 


* Generally, for the N-character game using random selection, the minimum number of 
questions is log, N. Note that 32 — 2 3 m this example. The most efficient questioning 
procedure-that which divides the universe in half at each question-is familiar to the 
contestant in the game of "twenty questions ” With twenty most efficiently asked ques- 
tions a contestant could isolate one item in a list of 2* — 1,048,576. The analyst often 
uses the binary questioning process to reduce system variety, as described m Chapter 11 
‘This is the measure of mathematical information theory, developed by C. Shannon. 
(See Appendix D.) 
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total number of questions that must be asked to duplicate the message. Any 
knowledge of structure of the message, any reference to previously stored 
information, or any knowledge of the variety of the incoming information 
can reduce the minimum number of questions that must be asked. 

To take the extreme case, suppose our contestant with the 32 character 
example is Jones, and by reference to our memory we know that Jones 
never picks any character except "A”, that is, Jones never plays at random. 
Then we need ask no questions of Jones to isolate his choice. Playing the 
game with him is totally redundant. It is for this reason that we find it easy 
to talk to persons who have a well-stocked and well-organized memory. 
They seem to anticipate what we are about to say. Indeed, from a mathe- 
matical viewpoint, any communication with an omniscient person is re- 


dundant. 

These principles and examples (in addition to the discussion of codes 
and information structure in the previous chapters) teach us much about 
the definition and design of higher-order systems. 

First, the higher-order systems will have a memory, and it will deal with 
symbols. 

Second, these symbols can be chosen in many formats and translated 
from one format to another as the need arises. 

Third, the higher-order system will use its memory (in combination with 
input information) to make predictions, and this predictive ability will re- 
quire combinations of input and stored information, and operations upon 
these combinations. 


Fourth, the number of system operations required for prediction, decision, 
and action, and the speed with which they can be performed, will depend 
upon the variety of nonredundant information that must be processed per 
unit time (as measured by the questioning process we just described). 

Fifth, the higher-order system will have the capacity to handle a much 
greater variety of information per unit time than the variety required for 
immediate control. This ability will be gained not only by the speed of 
operation, but also by the storage of data and many alternate patterns to be 
used tor swift translation, prediction, and the control of its own internal 
operations. 

In summary, the effective, efficient ability to store, retrieve, manipulate, 
fln s ^ m kol s is die initial step in building the higher-order system. 

We have seen that higher-order systems typically have large, fast, well- 
organized memories, and that this asset provides increased control ability. 
w e now study the organization of memory and the types of memory media. 


Memory Media 

c ^ racterisl |c of a powerful control system is the 


ti™ „T ' 7 “ I'vwenm control system is me 

w and type of memory medmm used. Memory size, usually measured by 
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the number of character storage possibilities, influences the number of 
different types of information handled at one instant of time and sets an 
upper limit to the variety of choices a system exhibits at a given time. 

The type of medium used for storage usually determines the speed with 
which information can be stored and retrieved. One extreme is the com- 
parison of data stored on a scroll with data stored in a card file. To retrieve 
information from a scroll, a sequential search pattern must be followed; 
however, in the card file the desired data can be located directly if the 
file organization is known. The same comparison can be made between a 
tape and a record, or between an electronic computer tape and its magnetic 
core storage. The song on a record or data stored in a core memory ran be 
"addressed directly”; the tapes however, must be searched sequentially, even 
when the file organization is known.* 

The speed of memory access and retrieval is often related to memory 
cost per character, as is memory flexibility, i.e., the ease with which charac- 
ters can be erased, written, read, or moved from one position to another. 
Thus, the magnetic core memory used in computers is the fastest and most 
flexible memory media available commercially at the present time, but it is 
also the most expensive per character. 

For this reason, in a complex system the physical media of storage are 
commonly arranged in a hierarchy of types, reserving the most flexible, 
fastest memory for information currently in demand. The mass, infrequently 
used information is stored on slower, less expensive media, such as magnetic 
tape. Thus, a modern system, such as the IBM 360 series of equipment, may 
employ memory functions ranging from magnetic cores operating in mil- 
lionths of a second (microsecond) or even thousandths of a microsecond 
(nanosecond) to a reserve library of tapes and cards that must be found 
by a human and carried manually to the system. 

We have used computer examples here, because of current interest in 
the subject. The reader should also realize that any medium capable of stor- 
ing characters or “signals” for any period of time can also be used for 
memory. Symbols may be written on stone, paper, or in the sand; stored 
temporarily on the face of a television tube; held momentarily as an accoustic 
echo in a tube of mercury, or as an electronic pulse in a wire “delay line”; 
held in a magnetic arrangement or in a collection of colored cards; fixed 
in the position of electrical relays, or the chemical structure of a molecule, 
or the genetic code of our genes; or stored in the human mind. 

The wide range of memory media available and the trade-offs possible 
between cost, speed, size, permanence, and flexibility provide the designer 
with a wealth of alternatives in the design of higher-order systems. 


1 For a survey of computer memory devices, see: J. A. Rajchman, "Computer Memories: 
A Survey of the State of the Art,” Pros. Inst. Radio Engineers, 44, 104, January, 1961. 
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Memory Organization 

We have seen that complex systems usually employ a variety of memory 
media that can be ranked into a hierarchy of types. The same is true of 
memory organization, where the hierarchy is set up according to the level 
or pervasiveness of the information stored, when used for system control. 
In general, the higher level systems have the ability to exert longer range, 
or self-adaptive control than the simpler systems. This fact will become 
evident when we discuss the types of memory storage. 


Types of memory storage 


We may divide memory storage into three major parts for discussion. 

1 Data or Analytic Storage, in which the information used in later opera- 
tions is held ( typically in a highly structured or segmented form, so that any 
desired part of the data elements can be retrieved at will). 

2 Routine Pattern Storage, in which the rules for translating and combin- 
ing data, ».e., the steps of a procedure of computation, are held. 

3 Novelty, Learning, or Goal Changing Storage, which, in the higher- 
order systems, provides rules for revising routine pattern storage, for chang- 
ing Uie priority of operations, or for causing a change in the memory content 
or of objectives of the system. 

The following computer examples will illustrate the distinctions above. 


Example: The distinction between data and patterns of operation must 
be clear before we proceed, because this distinction is essential to un- 
derstanding electronic computer operation. If the steps of a procedure 

Led?!? 1 ", Tl b rr° Storcd 3 memD,y “d executed with electronic 
Tthe ^ effectiveness of the machine is greatly multiplied. This 
totoectT b ' twc ' n ' h = rafcalator, which requires an operator 
^’s”, the having a stored pro- 

tLS “’ “ aUe '° P ' rf ™ re P cli,iv = operations on its osvn. 
S2& Pt08nUn “ ae “ SCntial ‘W*- ° f «*» electronic 

cmS St ^3 £££>’££?, s “ se ' 

is translated from French' ,o Ealfftt ‘f'T A ^ 

JS written, converted bv interrJL- * , P 01 “ con5taat - A message 
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History and technology and their transmission to succeeding generations 
is a transmission of patterns as Well as facts. Both are necessary. But, the 
recognition of patterns, the permanence or durability of pattern storage 
and translation of patterns without loss of structure are the essential 
ingredients of society, industry, science, and even personal sanity. The 
pattern, the structured relationship, is the essential ingredient in any 
system. Without a pattern, facts have no meaning, and no operation 
prediction, or learning is possible. 


In this example, the schematic memory box of Figure 5-5 shows two 
parts: on the left a series of numbers in lettered boxes (the data), and 
on the right a group o£ instructions, also in lettered boxes (the pattern 
of operation, or program). 


The pattern of operation on the right shows what is to be done to 
the contents of the memory boxes. The operations themselves do not 
depend on the contents of the data boxes, but on the name of the boxes. 
That is, if we changed the numerical values of the data stored in the 
various data boxes, we would get a different numerical result, but the 
pattern of operation, or steps in the computation, would be unchanged. 

To understand this distinction, we adopt the convention that the 
instruction “Add A to G" means “Add the contents of box A to the con- 
tents of box G and store the result in box C.” We also adopt the conven- 
tion that the system receives as its first instruction the contents of box H, 
and that it will proceed with the instruction in the subsequent adjacent 
alphabetic box unless the sequence of operations is modified by an 
instruction in the sequence. With these conventions, the reader may 
verify that the first number printed by the system shown will be 


( [(4 -h 5) X4] +41X4 « 160 


Data 


Instructions 


Memory location 

Contents 

Memory location 

Contents 
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Add A to G 
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Print G 
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M 

Add C to A 

G 

5 

N 

Go to H 


5-5 The relation between data and instructions (or pattern storage) m a system 
memory. 
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If the data were changed in Figure 5-5, the sequence of operations 
might be extended or repeated several times, but the operations them- 
seUes would not be altered. (For example, what would be the first nub- 
her printed if the contents of box A were changed to 10? Is the sequence 
of operations modified by this data change?) 

Note that by extending the memory size {the number of memory 
boxes available in Figure 5-5 for data and/or instructions) the variety 
of data and patterns of operation can be increased. Moreover, the inter- 
connections that can be employed between patterns increases at a geo- 
metric rate. 

To show this, consider the above system if we introduce "conditions" 
and “imperative" instructions, i.e., pairs of “IF* and “GO TO” instruc- 
tions. With this addition, the system can use its memory to switch plans 
of operation depending upon input data, or the results of intermediate 
computations, such as predictions. 

Example. Figure 5-6 shows a duplicate of Figure 5-5 with the program 
steps altered to provide a selection of plans, depending on the data 
contents of box A, e.g., we follow the plan stored in box L if the contents 
of A is 1, etc. Again, if we assume the system starts with the instruction 
in box II, the system will print 20, or 13, or 5, if the data stored in 
box A is 1, or 2, or 3, respectively. (What operation will the system 
perform if the contents of box A is net 1, or 2, or 3? The reader may 
verify that the system will print the results of applying the sequence 
of instructions, K, L, and If, or 23.) 


Instruction* 


Memory location 

Contents 

Memory location Contents 

A 

? 

H 

If A is 1, Co to L 



I 

If A is 2, Co to M 



J 

11 A i, 3, Co to N 



K 

Add B to G 



L 

Add F to G 



M 

Add C to G 


* 

N 

Pont C 



0 

Set C = 5 



P 

Read New Value of A 



Q 

Co toll 


cha *a« tha pattern ot system opera- 
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Note the important moral of these examples. The operation, or trans- 
formation, of a system with memory can be changed not only by a change in 
the data entered or stored in memory, but also by a change in the stored 
patterns of operation in the "program” portion of the memory, and especially 
by the interconnections that conditionally alter the selection of operating 
procedures. 

Thus, with a properly organized memory, the system can not only per- 
form repetitive manipulations of one kind, but can also take diverse actions, 
depending upon (l) the input information observed, (2) the data already 
stored, and (3) the pattern of program steps stored. 

The second-order systems described in the section on second-order feed- 
back imply a memory organization like that described above. And, with 
such an organization of information, the system can perform a range of 
operations limited only by (1) the size and speed of the memory and 
adjacent processing equipment and (2) the ingenuity of the memory or- 
ganization designer (the programer of computer “software,” to use a data 
processing term). Note, however, that the system must rely (at this level) 
upon the designer's rules. It cannot create or change its own rules without 
further complication of memory organization. 

Thus, we come to the third general type of memory storage-the storage 
section that concerns the modification of existing rules and patterns, the 
development of new plans, or the acquisition of new data. We consider 
problems of invention, novelty, value, priority, allocation of resources, 
adaption, and goal change. 

Example: Suppose the objective of the system shown in Figure 5-0 is to 
obtain an average output as near to 10 as possible for any sequence of 
inputs 1, 2, 3, . . . at box A. Suppose the inputs to A are 1, 2, 3, 1, 2, 3, . . . 
and the steps in the previous example are repeated with each change 
in input. The result would be an output of 20, 13, 5, 20, 13, 5, ... , 
yielding an average of or 12.67. This average is larger than our 
desired output (for the given input sequence). How could the operation 
-psAtem Sot *Avz. ■gwe.v. systesa altered, to xsjviK the. dawned ohyjGfcLve? 

The answer to this question would require analysis and correction 
at a level removed one step from the routine computation. We ( or the 
system) would have to scan the input data, store some of it, and deter- 
mine if a pattern were present. We would also have to scan the data 
now in memory and the present operating rules. Then we would need 
to develop a pattern for modifying the current operation pattern. This 
analysis and correction would be performed while the system performed 
its routine operation. When the higher-order considerations reached a 
decision to alter the present program, this change must be introduced 
by a swift and stable procedure. 

In the present example, suppose, we store the inputs ( e.g., 1, 2, 3, 1, 2, 3 ), 
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search the program and previously stored data, and create the fol- 
lowing rule: “If the incoming data pattern is 1, 2, 3, , change the 
contents of box Af to read ‘Add F to G’ and also change the contents of 
box L to read ‘Add E to GV* This change would give an output 13, 12, 5, 
13, 12, 5, ... , and result in the desired average of 10.00. (As an 
exercise the reader may try to devise other modifications of the program 
to achieve the same result. Note also that other modifications might be 
desirable if the input sequence changes, say to 1123311233. . . .) 


Learning and Memory Reorganization 

A system that can modify itself such as the one cited above, is said to learn . 
To some extent the designer can build learning into his system by providing 
for higher level supervisory procedures or routines.* However, it is also 
possible to construct machines that exhibit self-learning, or self-adaptation. 
( See Project 4 at the end of this chapter. ) 

To continue with the same example, suppose the designer has provided 
a modicabon routine that makes up rules, such as the one given for main- 
bmrng the system output at 10, for different tnput sequencer At some time 
such as 1, 2, 1, 3, 2, 1 or 112233112233, 
?' a repeating itself statistically so that to the 
g ran the Is, 2s, and 3s appear with the same frequency). Then, the 
s> stem may End that the same modification rule (used in the previous 

2 1 r? • ,or uf « * »****“ SS* 
if- ,h ' !e!ore coDc,ude ^ a ^ 

“cl«Sn w^f o^r^ fte ,'t urac y> 3 simplification in rale 

tion rule can be ntty eventua5! y learn that the modifies- 

3. in equal *• *• ^ 
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information for new; (2) a change in the interconnection of patterns; or (3) 
a change in the values placed on data patterns at a given time. 

Learning can take place not only by the acquisition of new information, 
but also by the reorganization of present information, or by a change in 
the interconnection and value of operating patterns as experience grows. 
Moreover, if memory and processing resources are limited, as they usually 
are, the process of learning introduces problems of resource allocation, 
priority of change or job performance, and innovating new procedures and, 
possibly, goal changes. 

What new information shall we consider? Where and how shall it be 
stored? What will it replace? How will the new information change our 
rules of operation? The priority of changes? The immediate jobs? Do we 
have sufficient memory, or the right memory content and organization, to 
perform the job? Do we have any other subjects we should be considering 
now, but are not? Do we have the right balance between data gathering, 
evaluation, learning, decision, and action? Is the job we are doing now 
worthwhile, or should we consider other objectives and goals? An entirely 
different approach to problem solution? 


A SYSTEM HIERARCHY 

We will discuss some procedures for answering these questions in later 
sections, but the present observations are sufficient to suggest a hierarchy 
of systems, arranged by the “order” of feedback and organized memory 
employed. 

The Simple Machine or Transformation 

The so-called open-end system or simple operation is of “zero” order, be- 
cause it has no memory and no feedback. 

The Simple Machine with Feedback 

This is the first-order system because direct feedback is present for control. 
No selective memory is present. 

The System with Conditional Selection of Plans 
and Predictive Feedback 

Here error correction is based on extensive memory facilities, careful 
memory organization, and the ability to evaluate and act on a wide range 
of different input conditions, often by predicting the requirements for im- 
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mediate actions based on Suture needs. We may distinguish a range of sub- 
classifications for this type of system, by the sophistication of the prediction, 
plan selection, and plan variety. This so-called tactical system, uses second- 
order feedback 


The System that Learns 

At this level, the system not only performs the functions above, but it can 
also develop new plans, new decision rules, or new predictions, or change 
the value of plans and methods to handle new conditions. The learning 
system may be thought of as a higher-order system, because it must Icam 
while performing its lower-order functions. The corrective selection and 
development of plans may be thought of as a third-order feedback process. 
The system must perform a number of functions at once; The immediate 
control of receptor and effector functions, the tactical selection of plans, and 
the development and evaluation of new groups of plans, or strategies. This 
mode requires a larger mcmoiy, a richer interconnection of patterns of 
operation, and a more complex memory organization. Feedback can take 
place for the immediate control of receptor and effector, the selection of 
present stored plans, and in the development of new plans by reference to 
the system goal, hence, the term third order. This is the ‘'strategic’* system. 


The Goal-Changing System 

If a system can learn and, as a result of what it has learned, “consciously* 
develop, select, and implement new and improved goals, it has clearly 
reached a higher level of autonomy. We call this “fourth-order* feedback, 
which corrects methods of learning, develops new problems to solve, in- 
novates, and controls goal-changing itself. Such a system must also have 
extensive memory facilities and be able to monitor and evaluate each of the 
“lower leveF processes and control their change. The system must also 
have an increased “awareness' of data and patterns, both internal and ex- 
ternal to the present needs of the system. The result will be increased 
richness of memory and memory interconnections. We may again distinguish 
a range of autonomy within this category: The system that can develop, 
select and implement its own goals (and integrate the corresponding strate- 
&es, tactics, and lower operations) would be considered more autonomous 

“ Uld ° nly 561601 and ™pkment a range of goals 
given to it by the designer. b b 

to the 0,d “ ° f l ystenl hierarch >' described here, it is useful 

to think of . comparison, such as that shown in Figure 5-7, 

at cSereS'r “ mlU,ary “BMiaaUon U a composite of systems 

different levels, usualfy arranged in a hierarchy of autonomy as suggested 
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by our classification. (As an exercise, the reader may try to make a similar 
classification of computer system functions in a ranking of autonomy similar 
to Figure 5-7.) 


Functionaries resporulbla 

System 

order Industrial organization Military organization 


0 

Production workers and salesmen 

Troops 

1 

Foremen and supervisors 

Sergeants 

2 

Executi\ e Vice President 

Company Commander 

3 

President and Staff 

Ceneral and Staff 

4 

Board of Directors 

General Staff 


5-7 A comparison of the functionaries responsible for administration of systems of 
various levels of autonomy. 


THE USE OF SYSTEM CLASSI FI CATION 

A knowledge of a system's "lever of operation, in the sense just defined, is 
useful to the analyst and designer in several ways. 

For the analyst, knowledge of the system level aids in the selection of 
diagnostic tools and procedures, indicates the need for possible simplifica- 
tion measures, and suggests questions that should be asked in determining 
system performance, or the location of system faults. The analyst must be 
aware of the variety and structure of the system he confronts, and the early 
classification of system level alerts him to such facts. 

VVe know, for example, that a zero- or first-order system (a simple- 
machine) can be approached on a mechanistic input-output basis of analyses 
for many puiposes, but that we must know about internal memory struc- 
ture before we make predictions concerning higher-order systems. We know 
that the more autonomous the system and the higher its level, the more we 
should ask about goal and value information, how memory is changed with 
time, and what memory now contains due to past experiences and learning, 
if we want to make reasonable guesses about the system’s future actions in 
response to input.* 


* An interesting example is the prediction of Supreme Court Decisions by an analysis 
of oersonal histories from childhood of each of the Justices. Such an approach was 
...JJ bv the Research Corporation of America under Leo Cherries direction to produce 
a correct prediction (contrary to popular belief) on the decision establishing the con- 
stitutionality of the Taft-Hartley Law. 
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The designer, on the other hand, may wish to improve the operation of 
his system by raising it to a higher level. Again, blowing even in rough 
form what must be accomplished or included in the system o ac iev 
this end is highly useful in planning design strategy, the change of designs, 

or in setting design goals. . l 

We know, for example, that giving the system some memory, even th°ug 
small will greatly enhance its ability to produce a variety of outputs an 
usually improve its control ability.' We also know that improvement of 
the higher-order system is often most effectively attained by improving its 
higher-order learning, memory, and goal formulation processes, rather than 
by dealing with improvements at lower-order levels. 

We conclude that an awareness of differences between system levels— 
and the steps necessary to make the transition from one level to another— 
can aid the analyst or designer to lift his own performance to higher levels. 


PROBLEMS 


51 In collecting a number of case histones of automation in industry, J- R- Bright 
(in Automation and Management, Division of Research, Graduate School of 
Business Administration, Harvard University, Boston, 1958) suggested 1? 
categones of the automation type listed below. These categories, which 
indicate increasing system autonomy, were used to classify the data collected. 
Comment on these categones in relation to the material in this chapter. 

1. Hand 

2. Hand tool 

3 Powered hand tool 

4. Power tool hand control 

5. Power tool, fixed cycle (single function) 

8. Power tool program control (sequence of fixed functions) 

7. Pow er tool system, remote control 

8. Actuated by introduction of work piece or material 

9. Measures characteristics of work 

10. Signals preselected values of measurement (error detection) 

11. Records performance 

12 Changes speed, position, direction according to measurement signal 

13. Segregates or rejects according to measurement 

14. Identifies and selects appropriate set of actions 

15. Corrects performance after operating 

18 . Corrects performance while operating 

17. Anticipates action required, and adjusts to provide it 


'ST * tsmputer et UX8-42 ooold oaly 23 deemul dial,. 1 
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PROJECT 4 

{(The following experiment is adapted from an article by II D Block, “Learning 
in Some Simple Xon-Biobgical Systems," American Scientist , 53, No 1, March, 
1965, pp. 59-79, where additional experiments of the same type can be found ) 
Because this experiment can be completed with easily available equipment 
-four paper cups and slips of paper or a blackboard-the reader should try it on 
his own to convince himself of the Teaming like" behavior of the system } 
Consider two Players, A and B Before them is a board, as shown below. 


a 

a 

D 

a 

a 

a 

a 

m 

m 

m 

a 

a 

si 

EI 

El 

EI 

EI 

El 

EI 

EI 





a 

a 

a 

a 

a 

a 

a 

a 

a 

m 

m 
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The additional equipment needed consists of a pair of dice. Player A rolls 
the dice and generates an outcome from 2 tlirougli 12. The number on this roll 
indicates the number of chips or markers that will be placed on the playing 
board. For example, in the picture above, Play er A has rolled an “8" and placed 
8 chips on the board, starting from left to nght as shown. Player B moves next 
He may remove 1, 2, or 3 markers (removing them from right to left) from the 
board. Player B must remove at least one chip or marker, but he cannot remove 
more than three. 

Player A moves again, now taking 1, 2, or 3 markers away This process 
continues with Player A and Player B alternately removing markers from the 
board. The player who talcs the last marker from the board loses the game. The 
game thus has the name, "Last One Loses.” 

If this game is repeated a number of times, we can plot the results m a 
cumulative curve. The plot moves up one unit if Player B wins, and down one 
unit if Player A wins. 



- Player A score 


Project 4-A 

Such a curve of results is in the form of a Teaming curve" for Player B. 
Although Player A can be a thinking human being, we can create a machine to 
think for Player B which over a period of time will produce a learning curve for 
Player B, who operates mechanically without thinking. 
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(The article utcd as reference for this experiment contains an extensive 
bibliography on machine learning. Read the original article ami a s_elcction of 
these references, such as W. G. Walter, "A Machine that Learns, Scientific 
American, 185, 2, August, 1951, pp. CO-63 and H. A. Simon, "llow Computers 
Can Leam from Experience," in W. F Freibeiger and W. Pragcr (cos.), Appit- 
cations o} Digital Computers, Ginn, Boston, 1963 A classic article by A. M. 
Tunng, “Computing Machines and Intelligence," is available in two sources, 
originally in Mmd, 1950. and repnnted as “Can a Machine Think?" in J. R. New- 
man, The World of Mathematics, Simon and Schuster, New York, 1958, Vol. 4, 
pp. 2099-2123.] 


Study a machine which simulates behavior W. G. Walter, in his classic The 
Living Brain, Norton, New York, 1953, has provided a popular study of brain 
waves and related subjects However, this book also contains many interesting 
sidelights and diversions which will be of interest to the reader of this chapter, 
in particular a discussion, in Chapters 5 and 6, of learning and machines, that 
exhibit animal-like behavior ("exploration, curiosity, free-will in the sense of un- 
predictability, goal-seeking, self-regulation, avoidance of dilemmas, foresight, 
memory, learning, forgetting, association of ideas, form recognition, and the 
elements of social accommodation") The subject used for description is a small 
mechanical animal consisting of two receptors (a photo-cell and a switch), 
two effectors (two motors), and two "brain cells" (vacuum tubes), which Walter 
has named Machine spcctdafrfx to indicate its searching ability. The construction 
details of this machine can be found in Walter's Appendix A, and a description 
of its behavior, with photograph, can be found in Chapter 5. 

Obtain a copy of this book, and read the cited chapters and appendix. Do 
you think this machine shows life-like qualities? Why is this macliine different 
from the programed electronic computer? In what way does it show the kind 
of behavior qieeified by Walter as human behavior? Why is the search and scan 
behavior? ** ^ * ™ p0Ttxnt lo lts °P«aUon and simulation of animal 
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VARIETY AND CONTROL: 
THE LAW OF 
REQUISITE VARIETY 


We have discussed the building blocks and procedures of systems definition, 
and also the range of resulting system types, arranged in a hierarchy of 
ever-increasing autonomy. We have seen that the more autonomous system 
can take a greater variety of actions in response to input than a lower level 
system. Moreover, we have discussed the system from two viewpoints; 
The "outside” view of the analyst, who hopes to predict, analyze and, in 
short, control the operation of the system; and the "inside” view of the 
designer, who hopes to make the system more versatile. 

Intuitively, at least, we should now realize that as a system becomes 
more autonomous, the analyst’s outside job becomes more difficult: As the 
system grows in size, complexity, and variety, the analyst has more difficulty 
understanding and controlling it In a sense, the analyst is in competition 
with the systems designer. The analyst hopes to control the observed 
system; the designer hopes to create a system that cannot be controlled 
against its will, but that can control other systems. The analyst faces this 
problem when confronting a system in nature, which may not be in direct 
competition with him. The more complex the phenomena, the more skilled 
and resourceful the analyst must be. 

A formalized rule (illustrated in Figure 6-1) having many strategic 
implications for the analyst is embodied in Ashby’s Law of Requisite 
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Variety and Control: The Lew ol Requisite Variety 


[ 


1 *1 

INTERNAL 

EXTERNAL j 1 

DESIGNER OR 

ENVIRONMENT j 

[ OPERATOR 

OR COMPETITOR 

OF A SYSTEM 

(JONES) U — 

| (SMITH) 


j COUNTER- 
‘ MEASURES 


S-1 Requisite variety In a competition. It the analyst hopes to control his system in 
the tace ot competition or a changing environment, he must be able to take 83 many 
distinct actions, or countermeasure;, as those available to the enemy. It the competi- 
tor can take A distinct actions, and the analyst-controller can take B distinct counter- 
actions, the ratio A/S must be less than or equal to 1 for the analyst to have as- 
sured control. In the examples to follow, we shall call the competitive system “Jones" 
and the analyst-controller “Smith," as shown above. 


Variety,’ so named after its author. Essentially, Ashby’s law states that, 
for an analyst to gain control over a system, he must be able to take at least 
as many distinct actions, Le., as great a variety of countermeasures, as the 
observed system can exhibit. In this chapter we will examine some implica- 
tions of Ashby’s law. 


WHAT IS VARIETY FOR THE ANALYST? 

From previous chapters we recall that the definition of a system is an 
arbitrary selection of variables and relationships, according to the analyst’s 
purpose. In short, the term system does not refer to “that thing over there,** 
which may be described by an infinite number of systems, but to a particu- 
lar list or collection of variables and relationships selected by the analyst 
for a particular purpose. 

How can the analyst increase the list, which defines his system? How 
does he End the number that measures variety? 

The length of the list, which defines the complexity of the system, 
depends upon the number or variety of distinctions the analyst (or con- 
troller) must make satisfactorily in order to act or to exert control. If the 
analyst must consider ten independent variables rather than two he will 
f * e fon f r j ° b more com P Icx tkm the latter. The analyst 
must therefore count the number of distinctions that are important to him, 

that this chapter York * 1963 • 11116 reader wifi note 

material in jUSt c ? vwed * Charter 5 and to the 

Sequences of the information theoryof C. Shannon ^ €Xpt * snng thc P racticaI **»*' 
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It is now necessary to clarify several definitions. Let the system have r 
distinct possible actions (A = 1 , 2 , 3 , . . . ,r) that arc important to the analyst, 
and let the analyst have c distinct possible actions that may be taken in 
response (B = 1 , 2 , 3, ...» c). These systems and analyst actions may be 
listed in tabular form, as in Figure 6-2. Note that this procedure com- 
pletely defines the variety of the analyst and the variety of the system, 
and shows clearly the relative variety. The rows in Figure 6-2 are the system 
variety; the columns are the analysts variety. 

Now suppose the observed system takes an action (selects a row), and 
the analyst takes an action (selects a column). Then let the outcome of 
this composite choice be shown at the intersection of the selected row and 
column in the table. This outcome may be thought of as a payoff in a 
gambling game. 

Example: Jones has a headache, a system action. Physician Smith is 
treating Jones. After observing Jones, Dr. Smith administers aspirin, a 
distinct action available to him. Let us say the outcome of this treat- 
ment is that Jones is relieved. Other system actions, treatments, and 
outcomes were also possible. Suppose Dr. Smith administered arsenic 
for the headache. The outcome of treatment would be quite different. 

As in the example, we may describe for each combination of system 
action and analyst reaction an outcome, which may be named, or measured, 
and placed in the outcome table. 


POSSIBLE 

COMPETITOR 

ACTIONS 

(JONES) 



6-2 The format Of the outcome table. 



Variety aM f Control. The Law of Requisite Variety 


m 

COMPLEXITY 

We must now observe that the “complexity” of a system is not measured 
by physical sac, but by the distinctions the analyst must take, or chooses to 
make, In other words, our measure of variety measures complexity. 

Example. An industrial organization of 100 workers, such as cottage 
weavers, who work independently on identical jobs, does not present 
significant control problems. If one weaver falls ill, the organization 
does not materially suffer. Indeed, the organization can easily be ex- 
panded or reduced in size. One directive will regulate all the workers. 
For the analyst, the total operation can be reduced to the operation of 
one weaver, multiplied 100 times. 

However, when ten machines operate with division of labor and a se- 
quential flow of work with, let us suppose, ten different products that re- 
quire varying sequences of demands and technological requirements, the 
number of possible sequences is ( 10! ) "\ clearly beyond complete control 
of management. 

Complexity, then, refers to the variety the analyst must encounter to 
obtain “adequate” control. Tlius, to use Ashby’s example, a cattle breeder 
may not be able to obtain ultimate control over the exact combinations of 
genes (which determine heredity properties) m bis stock. For the breeders 
purposes, however, this may not be necessary. The breeder need make 
only those distinctions required for his success. He need only control pair- 
ings that will increase weight, marbeling, fertility, etc. He may be uncon- 
cerned With the length of the steer’s tail. 


REQUISITE VARIETY 

The analyst, or controller, then, defines his system according to the number 
ol distinct actions that arc important to him. It tho analyst’s resumces 
are limited (as they usually arc), he will attempt to limit the variety of 
distinctions to the minimum number, necessary (or success in the particular 
problem. 

If such a -minimum list" procedure is followed in defining the system, 
uliat is meant by requisite variety? To answer this question, ivc will now 
consider a senes o! gambling csamplcs in which Jones represents the system 
and Smith the analyst-controller. 

£r<mt,.fc To generalize, suppose Jones and Smith play the SolWIng 
matduog game. Jones (the system) moves first and places on the table 
l* 1 * l T “’W' '“Smith (analyst-controller). Assume Smith then 
plates an oh,ect ol hi, choice on the table. Jones pays Smith SI i( the 
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objects match, but Smith pays Jones $1 if the objects, for any reason, 
do not match. Now, as a first example, give Smith and Jones each one 
coin, and let either coin show heads or tails as they may. Then Smith 
will always win , because he can always match Jones. Smith has the 
requisite variety of actions, Le., the required two moves, which can 
counter the two possibilities offered by Jones. Only one outcome is 
possible in this game (assuming Smith's vision is good and he plays 
correctly!). That outcome is "Smith wins," the outcome “a” in Figure 6-3. 

Requisite variety, then, is the minimum number of distinctions or actions 
the analyst or controller must have to malce the outcome of such a contest 
certain. If requisite variety is measured in this way, Jones can win only by 
increasing the variety of his actions compared to Smith. 

Example : To illustrate, let Jones have not only one coin to show, but 
also one six-sided die. Jones may now show H or T, or one of the die 
sides, 1, 2, 3, .... 6. He has eight possible actions. If Smith still must 
match Jones to win, and if Smith only has one coin (but not a die to 
show), it is easy to see that Jones now controls the game, even though 
he goes first. Even if Jones plays at random, he will win £ of the time, 
i.e., when he shows a die. If Jones "1631715" of Smith’s predicament, Jones 
can always win (by always showing his die). Thus, Smith has lost 
control of the game. He cannot guarantee the outcome. The previous 
example’s outcome is reversed, because the available variety ratio now 
favors the system (Jones). See Figure 6-4. 

We should clarify another point before proceeding. Note that the reverse 
situation, i.e., giving Smith a greater variety of possible actions than Jones, 

Is not sufficient to guarantee that Smith can always win. (See Figure 6-5.) 
Smith must not only have at least as many possible actions as Jones, but he 
must also have precisely the correct ones in his list of possibilities. Thus, 
the required number of alternatives is a necessary condition to exert com- 


SMiTH 



6-3 An example of the outcome table for a coin game. 



,40 V«Mt ana Cant'd- Um cl MM V«M* 
SMITH 



6-4 A disadvantageous matching game lor Smith 

plete control. The inclusion (within that number) of the precisely correct 
alternatives will then provide the sufficient conditions for Smith to 'Vin 
always. 

The result of this discussion may now be extended to the case of many 
outcomes. 

Example: Let us suppose that a different outcome can occur for each 
combination of play that Jones and Smith make in an extended match- 
ing game, defined as follows. Jones will select a number between 
1 and 9 and then Smith, knowing Jones' selection, can select a letter 
A, B, or C. The resulting outcome, or payoff, is indicated by the lower- 
case letter in Figure 6-8. Note that no outcome is repeated in the same 
column, so that to obtain a given outcome as Jones moves, Smith 
must move. 

Now suppose that, regardless of the particular outcome, Smith 
changes his objective and now wishes to minimize the variety of possible 
outcomes Because Smith is completely familiar with the payoff possi- 
bilities, be can determine, in advance, his response for each move Jones 
makes, e.g., 

If Jones, 1 2 3 ... 9 

Then Smith, A B C ... A 

Gives outcome a a a ... r 

It is now possible to state that the minimum number of possible out- 
comes cannot be less than 
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6-5 Another disadvantageous matching game tor Smith. As In Figure 6-4, Jones has 
a coin and a die, so that he may display eight distinct outcomes. Jones also has eight 
distinct outcomes, provided by his coin, and, say. the Ace, King, Queen, Jack, 10, and 
9 from a deck of playing cards. If the game requires exact matching of objects and 
attributes, It Js apparent that Smith can win (match exactly) In only two cases out of 
tho 64 possible pairs. Although Smith has the requisite variety to match Jones, Smith 
does not have the correct distinctions required for matching The Important point, 
however, Is that if Smith had less than eight possible distinctions, he could surely be 
defeated by Jones. With the requisite number of distinctions, Smith has a chance — If 
he has the correct distinctions In his set of possibilities. 


( The number of choices Jones can make ) 

(The number of choices Smith can make ) 

where the contestant making the second selection appears in the 
denominator; i.e., the minimum variety of outcomes in the contest is 
equal to the ratio of the variety available to the two contestants, 2 in this 
example -| = 3. 

We could easily prove this result In Figure 6-6 by selecting a column 
for each row that will duplicate successive outcomes, thereby minimiz- 
ing the variety of outcomes. Suppose outcome a is chosen in the first 
trial. We would then select column I for row 1, column 2 for row 2, and 
column 3 for row 3 (for the outcome table given). But at row 4 we 
must introduce a new outcome, because, by our assumption, no outcome 
can appear more than once in a given column. The minimum number 


* The ratio must be ^ 1 for Smith to gain complete control. 
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SMITH 



6-6 A generalized matching game with many distinct outcomes. This game Is the 
same as belore H Jones selects a row and Smith a column, the outcome indicated 
by the letter at that row-column Intersection will occur. However, now there are vari- 
ous kinds of outcomes, not Just "win" or "lose ” This table is used in the discussion 
o) the text to derive the Law oi Requisite Variety 

of outcomes follows from a continued application of this procedure. So, 
for a table with r row's and c columns, and with no two outcomes in the 
same column equal, the variety of the selected set of outcomes cannot 
be less than r/c. 1 

By reference to the outcome table of the above example, Figure 6-6, 
we are now prepared to state Ashby’s Law of Requisite Variety formally: 

The variety of outcomes, if minimal (in the sense of the precious ex- 
ample), can be decreased further only by a corresponding increase in the 
number of columns of the outcome table. 

Example: In the outcome table. Figure 6-6, Jones has nine choices and 
Smith has only three. Suppose the outcomes shown relate to various 
forms of Smith’s health after an encounter with a Jones action. Suppose 

’ If two or more identical outcomes are allowed per column, then the variety of outcome* 
can he less than the number of rows, even though no column selection Is made in 
response to a row choice. la this case, the minimum number of outcomes can be less 
than the ratio given above. (Consider the table in which all outcomes are a, 
regardless of the row and column selection.) Moreover, if the outcomes In the table 
are afi nsBeient, the minimum number of outcomes will be greater than the given 
ratio, Le., as many as r X c, depending upon the table entries. See Problems 6-2 and 6-3. 
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also that only if outcome n occurs can Smith survive; c.g., all other 
outcomes represent death. Can Smith guarantee his survival? He cannot. 
Why is this so? * 

For a variety of input conditions (Jones), only an increased variety of 
alternatives for the controller (Smith) can modify the variety of output 
(outcomes). 

Only if the variety of the regulator is at least numerically equal to, 
or greater than, the variety of the system to be controlled, can control 
be a certainty. 

Example: Reverting to the matching game (die-coin versus die) of 
Jones and Smith, we can cause Smith to win at each trial only by giving 
him the additional alternative of showing a die as well as a coin. Remem- 
ber again that an increase in Smiths variety would not necessarily allow 
him to win all the time. If we let Smith play with a roulette wheel (or 
some other object) instead of with the required die, it would not help. 
The point is that, to reduce the outcomes to win alone, we must at least 
increase the number of Smith’s alternatives, otherwise no improvement 
is possible. It is essential that Smith be given a numerically increased 
set of alternatives. 

This is the first meaning of Ashby's law, which gives a necessary 
condition for control. The sufficient condition, which may be confirmed by 
consideration of Figure 6-6, is that Smith’s set of alternatives contains at 
lease one “successful" outcome for each Jones action, or row. Thus, if a is 
the only successful outcome allowed, then the Smith alternatives must pro- 
vide at least one a in each row of Figure 6-6. (If only one o is allowed per 
column, then at least as many columns as rows will be required, as we saw.) 
Thus, the minimal set of Smith alternatives to be successful must also show 
a winning outcome for each Jones move. This is the second, sufficient 
condition implied by Ashby’s law. 


REQUISITE VARIETY AND ITS CONSEQUENCES 

Ashby’s law tells us that control cannot be certain unless the controller has 
at least as many alternatives as his defined system can exhibit. This gives 
us a numerical lower bound on the requisite variety of actions the controller 
must have to be effective, a measure of the minimum requirement. 

The controller must often have many more alternatives, in order that he 
have in reserve the appropriate set to use in a given problem area, but the 
controller can never have less than the requisite number. 

* Because, in general, e must be at least ^ r for Smith to gain complete control. 



1 44 Varte fy end Confro/: Tft* Law of Requisite Variety 


The derivation of Ashby's law is a property of the outcome table, and 
not of any particular real-life situation; it is not subject to the properties 
of any particular machine or system. Rather we should say that any machine 
or system defined by the analyst 3nd reduced to an outcome table must 
inexorably obey the demands of the tabular geometry. To make the tabular 
presentation meaningful (see Problem 6-2) the analyst must be able to 
list all of the system actions that arc important to him, but this is precisely 
what we mean by defining the system. 

Although relatively simple in concept, Ashby’s law is an elegant and 
powerful tool for the system analyst and designer for the following reasons; 

1. It establishes an upper limit to the amount of control that can be 
exerted in a given case, or conversely a minimum requirement that the con- 
troller must meet 

2. It tells U! there are only two major ways to adjust the ability of the 
analyst (or controller) to the requirements ol the thing to be controlled: 
(1) Increase the variety of the controller, or (2) reduce the variety of the 
system to be controlled. 


Eiatnple: The biological organism Is able to counter changes in the 
environment, such as moderate changes in humidity, temperature, and 
oxygen content of the air but it cannot handle all possibilities. To sur- 
do not r ?" iCt lhc ranse °' its "riranment. Tims, sve 
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Physician is subject to Ashby's law. 
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Example: The traffic director of a large city does not attempt to regulate 
the precise movement of each vehicle. He works to control the general 
traffic flow by introducing restrictions (one-way streets, limited access 
roads) and by increasing the variety of his control actions (the com- 
puter control of traffic lights, based on a variety of area traffic flows). 
He cannot do otherwise. 


THE RATE OF VARIETY 

We will now consider how the rate at which variety can be handled affects 
the systems analyst. We will use an informal approach in this section but 
we will present the subject more formally in the next section. 

Example: We turn again to the die-coin versus coin game of Jones and 
Smith. Let us suppose that Jones can display one of his eight choices 
(H, T, 1, 2, 3, .... 6) once each minute. Then, the number of different 
choices Jones could make per minute, his rate of choice, is eight, and 
this combination of the number of distinctions possible per play and 
the rate of play gives us one measure of the rate of variety Jones can 
produce. 

As we have seen in Figure 6-3, Smith can select, say, only H or T; 
suppose he also can make one play per minute, with, of course, two 
distinct choices. Smith’s rate of variety as measured above would be two 
per minute. If Jones, as the first player, can only make two choices 
per minute, then Smith will always win, as before. He has the requisite 
rate of variety to do so. 

If we compare this situation to the “ one shot” example of Figure 
6-3, we have the feeling that the Law of Requisite Variety should hold, 
if we simply replace the word “variety” with the term “rate of variety.” 
Our intuition here is correct. 

Example: To continue with Jones and Smith, let us now give both Jones 
and Smith a coin and a die (so that in the “one-shot” game Smith will 
always win if Jones goes first). Now change the rules of the game 
slightly as follows: Jones goes first, and a record is made of the 
sequence of his choices, e.g., HH63T12T .... for 60 minutes, so there 
will be a string of 60 characters available at the end of the hour (due 
to Jones). Because he plays after Jones in full knowledge of what Jones 
has done. Smith may play as Jones chooses or at any time thereafter, 
provided the record of choice is preserved. Now, Smith, either at the 
time of play or later, is allowed to see Jones’ choice record, and is re- 
quired to match the sequence of choices to win. Smith will be paid $1 
for each character (Jones’ choice) matched as before, but Smith must 
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BW Jones SI for each character Ml matched ana reason, “ bcr “ r =- 
1 II Smith can also male one choice per minute, and a also allowed to 
phu one hour, it should be clear that Smith will be S80 nehcr when he 
quits. It is the same as ii Jones and Smith played 60 one-shot games in 
succession. 


Example; It should also be equally clear that if Smith can make only 
one choice every' two minutes (30 per hour), Smith cannot duplicate the 
total exact sequence of choice Jones has made, but only half of them 
(and thus would win nothing). Similarly, if Smith can make only one 
choice per hour. Smith can duplicate in one hour only the Jones' first 
choice, and, by the rules will pay out $59. 

The analogy between the controller (Smith) and the system (Jones) 
is exact. The controller must exhibit at least as great a rate of variety 
as the system to be controlled, for the outcome of control to be certain. 

An extension of this idea, which may not be obvious, comes into 
play when the controller and the system are allowed to work on the 
basis of different time periods. 


Example: Suppose in the preceedmg example Jones (working at one 
choice per minute) makes his 60 choices in the one hour alloted to 
him, and quits. But, suppose also that Smith (working at 0.5 choices 
per minute) is given two hours to complete his task. Then Smith can 
duplicate all of Jones’ choices exactly, and Smith can again completely 
control the situation. 

This result does not alter the "rate interpretation" of the Law of 
Requisite Variety, it merely reminds us of the exact definition of "rate." 
By extending his variety over a longer time, Jones’ rate can be cut 
effectively to Smith’s level. Indeed, it may be more illuminating to 
state that the choices available to the controller must be at least equal 
to the possible choices exhibited by the system in a given time period 
for control to be certain. But, this is just a different way of describing 
the ratio requirement for rates of variety. 


REQUISITE RATE OF CODED VARIETY 

Consider the situation that allows the controller to work (Mr, than the 
system, i.e.„ the situation in which Smith makes choices faster than Jones. 
Example: Let ns, as in Figure 6-4, 6 ive Jones a coin and a die, so that he 
ha, eight Asttnctehote (H.T.1& 3, . ...6), and allow- Jones to make 
. 5 « minute, or 60 per hour. Now suppose Smith has a 

wtth two choices (H or T). as Wore, bn, , „ o^vo him 

the following ability: He may nse three choices of his coin, e.£ IITII, 
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and it will be understood that such a unique selection will be trans- 
lated by Figure 6-7 to be equivalent to the Jones choices. 


HHH-H 
THH — 1 
HTH - 2 
HHT — 3 
HTT - 4 
THT - 5 
TTH - 6 
TTT -T 


6-7 

Thus, if Jones shows a “1,” we will say Smith wins if he shows the 
unique equivalent combination, or code, ‘THH,” which, however, repre- 
sents three selections for Smith. Now, if Smith can make selections of 
H or T three times as fast as Jones can select (H,T,1,2,3, ,6), and if 

the translation in Figure 6-7 is allowed, then Smith can always win, even 
though his range of choices is limited to two compared with eight for 
Jones. 

This example is of practical importance. We find that by working faster. 
Smith, who has only two choices, generates as many different outcome 
sequences as Jones, who is working more slowly with eight choices; Smith 
thereby builds the requisite rate of variety. Given adequate time, or a 
sufficiently fast rate, the controller can always generate the requisite amount 
of variety— if the above translation ability is permitted. In the information 
theory terms of Chapters 4 and 5, we can illustrate this point formally by 
stating the equivalence in terms of the equivalent number of "binary ques- 
tions” that each choice would require. Thus, with eight distinct choices, each 
choice of Jones contains log 2 8 = 3 “bits” of information. With only two 
choices, each of Smith’s selections contains only 1 log 2 2=1 “bit.” The 
variety of possible distinctions will be equal; i.e., Smith can generate as 
much “information” as Jones if Smith can make three choices while Jones 
makes one, because 3 log 2 2 = log-. 8 = 3. 

We may now restate the Law of Requisite Variety for the rate of variety 
in coded form: For the outcome of control to be certain, the controller must 
generate at least as many bits of information per unit time as does the 
system to be controlled. 
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MEHOHY CAPACITY ANO 5PEE0 OF PROCESSING 

We have seen tot to obtain complete control rf a system tte analyst or 
controller must have three abilities. 

1 At least as many distinct available alternatives as the system can 

’ eshibit (or equivalently, the ability towotk longer ot faster with coded 

equivalents V. . , , . 

2. The precisely correct set of alternatives within the set available to 
counter those generated by the system (or equivalently the precisely 
correct translating ability); 

3. The processing ability to use these distinct actions {or their coded 
equivalents) at a rate at least equal to the system to be controlled (or 
such that the information generated by the controller per unit time 
equals that generated by the system). 

The controller can now handle a wider range of considerations or "sys- 
tem oppositions" if he js given both an increased reserve of possible actions 
and an increased processing ability, in terms of speed and memory organiza- 
tion This is the same result we developed when we discussed system 
autonomy in the last chapter. 

Exomple: Taking a final look at Jones and Smith, suppose we give 
Smith, the controller, a vast array of objects to use in his matching 
games-dice, coins, cards, roulette wheels, and similar objects fa* pro- 
fusion-and so organize this inventory of objects that they may be stored 
and retrieved in a few millionths of a second. Further let us provide 
Smith with the ability to distinguish one class of stimuli from another, 
e,g , coin observations from die observations, and a program of instruc- 
tions, so that upon seeing what Jones does, Smith may retrieve the 
correct object and present it in its coned matching state, Let Jones 
have a coin and a die as before, which he may show at one choice per 
minute. 

Smith can now take on Jones with absolute certainty of success. And 
in addition. Smith can take on many other gamblers like Jones, who 
may be allowed to play with ah objects m Smith's collection, at the 
slower rate. 9 Within the limits set by Ashby’s law, Smith can defeat all 
opponents. He is in control because of bis alternate reserve resources, 
his ability to organize and integrate his action, and his speed of process- 
es- Consequently, Smith can exploit Ashby's Law of Requisite Variety 
by using his speed, his reserve of actions, and his organized patterns of 
operation to counter the environment and the other gamblers system. 
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We may recognize in the last example an analogy with a modem elec- 
tronic computer or information processing system which, by high speed 
and a large store of data and programs, is able to taclde a wide variety of 
problems with success. 


SYSTEM SIZE AND ASHBY'S LAW 

Finally, note that the derivation of Ashbys law is independent of the size 
of the outcome table, i.e., the law is not altered by the number of rows 
and columns that must be considered. Conceptually, Ashby’s law holds 
for the comparison of an analyst (or controller) of any complexity versus 
a system of any complexity. 

Moreover, we do not care how the outcomes shown in the outcome 
tabic come about. We do not care, for example, how Jones and Smith 
placed their objects on the table in the matching game: Jones may have 
informed his butler of his choice, and had the butler place an object on the 
table. Smith may have dropped his choice through a hole in the ceiling. 
Any other procedure would have been satisfactory from the viewpoint of 
this chapter: The procedures by which the choices produce the resulting 
outcome are irrelevant to what has been said here. This fact helps emphasize 
the generality of what has been reviewed. 

In short, to return to the purpose of this chapter, Ashby’s law tells the 
systems analyst that he ultimately may think of a system as a set of “If- 
Then” combinations and their corresponding outcome table, that this form 
of systems definition holds for any size system, or for any procedure used 
in obtaining the end result And, once the outcome table has been deter- 
mined, Ashby’s law prevails. To assure control, the analyst must have the 
requisite variety that can be measured. If he does not have the requisite 
variety, the analyst-controller must either simplify his system to obtain con- 
trol or increase his own variety of possible actions. Ashby’s law will guide 
us in the chapters that follow. 


'ASHBY'S LAW AND INFORMATION THEORY 

Before continuing, the advanced reader may wish to consult Appendix D, 
a brief technical discussion of information theory. In the foregoing discus- 
sion of the rate of requisite variety it was assumed that the choices made 
by the system (Jones) were random (all equally likely). Were this not so, 
the number of bits generated by Jones (called the source in Appendix D) 
would be less than the maximum possible, i.e., his actions would be re- 
dundant to a certain extent. To include this possibility a measure of source 
variety called entropy is defined in Appendix D. 
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to infonnation theory terns, Smith may be considered a 
is to handle intonation tom a source. J«* as we measured Snaths possible 
variety, the capaetty of the channel can also be measured in bus per unit 


With these terms in mind, is e can state Ashby s law to information theory 
tenns. The capacity of the channel in bits per unit time must be at least 
as great as the entropy of the source if messages are to be transmitted 
without error, assuming the absence of noise. (Noise reduces the effective 
capacity of the channel) 

Finally, the reader should note the importance of codes in matching 
the source to the channel. This was illustrated when we permitted Smith 
to use THH to match the Jones choice of 1, as in Figure 6-7. The code 
permitted us to match Smith's faster rate, yet limited number of outcomes, 
to Jones' slower rate and wider choice. In general, information theory tells 
\xs \hafc i£ she c&p&cfty a. e&usttl vs te she of a. source there 

is always some code which will provide such a perfect match between 
source and channel. 


PROBLEMS 

6 1 Compare the following procedures and comment on the benefits to be ob- 
tained using the second Method 1. Attempting to teach a student a series 
of numbers, you ask him to memorize the following list 

58121519222629 

which he is to recall on demand Record the accuracy with which the 
student can recall the list alter one week Method 2: The objective of the 
experiment « still the same, i.e , the student reproduces the above numbers. 
Ho\ve\er, in providing the list for memorization, some structure is now 
added 


5.8, 12, 15, 19. 22, 26, 29, . . . 

and the pattern of generation-alternate addition of 3 and 4 to the last 
number-45 also explained to the student. Again, ask the student to recall 
Ihe number, alter a week. Compare the effect of the method of patient 
presentation to that of detad presentation upon the accuracy of memory 
mention of a human heme Why » it easier to handle the pattern? to 
I'f" 0 '’"- " e »**> ° PWtem of relationships from 
Jt"** 1 observations Ho.v docs the definition process relate 
fatta TeT, ■ ta * ? (F “ '“•>« esperiments, see G. 

oSTcaS Sf^h 1 «» g . Studies In lfie ^jehob^ o/ Looming 
i eac ling, Columbia University Press, N« w York, 1940.) 
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6.2 Under normal conditions we have in slock ten distinct items to ship. Items 
1-5 are a deluxe version of the standard product, 6-10 Due to a temporary 
malfunction in the production department, items 6-10 are out of stock, but 
we have plenty of 1-5. A temporary decision is made to ship the deluxe 
version (at no increase in standard price), when the equivalent standard ver- 
sion is ordered. We now have a variety of only five actions to take to counter 
ten distinct disturbances, the incoming order types. 

A. Does this situation contradict Ashby’s Law of Requisite Variety? Explain 
in terms of the tabular presentation of Figure 6-6. 

B. Because of great response to our offer, we find we can increase produc- 
tion runs and sell the deluxe versions at standard prices in the future We 
therefore delete items 6-10 from the product line. Does this action alter 
the previous solution in terms of Ashby’s law? 

C. We now find that customers are indifferent to products 1 and 2 and 
3, 4, 5, i e., if they order either of the first two or any one of the latter they 
will be satisfied with any member of the given set How would you interpret 
this new result in terms of Ashby’s law? It appears that only 2 responses 
will satisfy 5 distinct stimuli. Is this a contradiction? 

6.3 In Figure 6-6, where there are nine rows and three columns, suppose Smith 
is indifferent to the outcomes a, r, l, and d. 

A. Does Smith have the requisite variety even though the rows exceed the 
columns in number? 

B. Suppose outcomes q, I, and s are all disastrous to Smith. Does he then 
have control of the situation? 

C. Would the knowledge that q, /, and z are all disastrous to Smith be of 
advantage to Jones (1) If he knew Smiths constraints of movements, i.e., 
the table of Figure 6-6? (2) If he did not have that knowledge? 

D. How would the knowledge that q, /, and z are disastrous outcomes for 
Smith be of greater benefit to Jones if he also found out that Smith could 
not use his alternative C? 

E. A knowledge of a competitors choice constraints and the joint outcome 
table provides a great competitive advantage. How can this be illustrated 
in Figure 6-6? 

F. Suppose in Figure 6-6 Smith must have outcome a occur to survive. 
How can he survive? 

6.4 Consider a system that is supposed to perform a regulatory function. This 
system, as shown in diagram form below, receives in a discrete series of 
intervals, a stimulus S, which may be one of three inputs, A, B, or C. The 
input S is transformed into a “successful" or an “unsuccessful” output E, 
depending upon the state of input S and the then current state of the 
regulator R, which may take on the states 1, 2, or 3 The conditional trans- 
formation determined by fl is shown in table T, in which a indicates success 
and b indicates failure. 
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fi-cholee 


1 2 3 



Let us assume further that the regulator R may be changed at each 
step of the input S, but tliat the clioioe of R's state, 1, 2, or 3, must be 
predicated on the last previous effect E observed. That is, any possible 
corrective action is delay ed until the next instance, so that the choice of R s 
state must be made without knowledge of what S will be. The only informa- 
tion available is the past history of the R choices and the corresponding 
results, or effects E. 

In this situation, which corresponds to most discrete industrial control 
situations, such as industrial quality control which monitors product vari- 
T . l eg 7^ b0n Can never be because of the one-step delay and 

the lack of knowledge of what causes a result. (The regulator “sees” only 
the output state o or b but does not know what input state caused this 
result-even though the transformation table T may be completely known.) 

We now wish to evaluate two policies for R in wliich a policy' will be 
defined as a sequence of R choices. We assume input B is most likely. 

innm regulator action 2 always {based on the fact that 

input disturbance B is most likely), or 

under tbp c 01 ^ al ran dom, if two consecutive b's occur, 

S* U S T ^ n *: b r«- .re equrdly 

f„ L Jt, ™ , T* <OT 1 3 > M >» >» output «. thi, policy tali 

! 2 Ue cyd« ItoberritJafter 

rtta. to Mate 2 accordrug to the raudom rule ,i hvo more bs occur. 

ATO^policy * !up , rio[ (t Mcomes Ien) ,( dtottaow _ or 
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This problem illustrates the "bit" 
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making the computations in this and the 
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A manufacturer has four customers: A, B, C, D. If the orders arrive at 
random (i.c., are independent, customer to customer): 

A. What is the entropy of the source of information, i e., how much in- 
formation is gained by knowing an order comes from a particular customer, 
if all customers are equally likely to send in an order? 

B. If the probabilities are, respectively, 0.4, 0.3, 0.2, and 0.1 for customers 
A, B, C, and D, what is the entropy of the source? 

NOTE: log* — 3-32 logio N; therefore, N log zN 

Oil -3.32 

0.2 -2.32 

0.3 -1.74 

0.4 -1.32 

C. If 100 orders are received during the week, what is the total amount of 

information received by discovering, as each order arrives, the customer 
from whom it came? 

D. Two hypothetical machines, Model I and Model II, are designed for 
display of customer credit standing. Both machines work with a magnetic 
belt formed into a loop sequencing the customers in the deterministic trans- 
formation 


A-*B-*C-*D 

The tape is advanced by pushing an “advance button" which moves the tape 
ahead one customer at a time. The current customer status is displayed on 
a cathode-ray television tube. In both machines, the operator indicates 
credit disapproval or approval by means of a keyboard, and in the latter 
case updates the receivables outstanding. However, the two machines differ 
as follows: In Model I the approval or disapproval of credit leaves the 
present customers record in view, whereas in Model II the picture is 
automatically reset to customer As display. Compare the two models by 
computing for this application the expected number of operations of the 
"advance button” per order. Assume the orders are handled as they arrive, 
i.e., are not presorted, and that each order must be checked on the machine. 

E. Why is Model II preferable in this application, and why is the number 
of pushes required of the “advance button” \ess in both cases than the 
entropy in bits computed in B? How could this machine be improved? 

6.6 In this hypothetical example, a CIA employee is to receive a coded message 
via the notes of a song which he receives over the regular Voice of America 
transmissions, 8.00 p.m. Monday nights, his time. The tunes are composed 
especially for the occasion and always start with the same note repeated 
three or more times, the first change indicating the start of the message. 
Rhythm is not part of the message. Seven preselected major tones (A-G), 
regardless of scale, are used in the compositions, with middle C and C 
above middle C being considered equivalent. The basic melody used in the 
code is usually obscured by extraneous background arrangements. A piano 
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i< used to transmit the message The coding of the message uses the transi- 
tion between a pair of notes to indicate a gisen alplubetic or numeric 
symbol, as shown in the proposed matrix of -19 cells (see below)- Thus, a 
transition Irorn D to C yields the decoded letter X and so on. However, of 
the 49 as affable cells, fen are not presently used 

A How could the unused spaces be met! to the greatest advantage to 
improve the secrecy of the messages 3 

B The composer now suggests also using the sliarps and Cats In sending 
messages, because this would permit a greater saricty of composition. Wltft 
principle should be followed in laying out the decoding table to maximize 
secrecy 5 

C The agents now decide to extend this coding/ decoding procedure by 
using scscral transformation tables of the type shown, but with different 
letter arrangements, thereby converting the single transformation to multiple 
transformations selected by an input The choice of table it to be by the 
name of the composer of the selection played iust preceding the actual 
message Tins song is played at 7 55 p xr just before the station break, and 
is used as background music for the announcer concluding the program 
for that half hour When this is done, comment on the complexity of the 
decoding equipment that must be as affable to the agent m the field. 

SECOND NOTE 
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DIRECT SIMPLIFICATION 
METHODS 


As we saw in the last chapter, the systems analyst must have at his dis- 
posal at least as great a variety of possible actions or resources as the 
system he wishes to control. This requirement raises the important question 
of system simplification, because quite often the analyst will not have the 
requisite resources to define, analyze, and control the system he confronts. 

Indeed, the act of system definition itself raises this question, for in 
selecting the elements and relationships he will use for his system in a 
given case, the analyst will always face that infinity of possibilities we dis- 
cussed in Chapter 1. 

What is the analyst to do? As we remarked in Chapter I, we have no 
universal theory of systems simplification that can be used to select elements 
and relationships in a particular problem. We can outline some useful 
approaches, however, which will arm the analyst with at least a strategy of 
systems simplification. This chapter and the next will outline a few such 
strategies. 

THE COST OF SIMPLIFICATION VERSUS THE 
COST OF PROCESSING ABILITY 

The meaning of "simplification” has changed with our ability to process 
information swiftly and inexpensively. For many years investigators at- 
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tempted to create elegant theories, descriptions, and systems definitions that 
relied upon few assumptions or that required few steps to complete a demon- 
stration in logic or mathematics. The aim teas to obtain a neat, apt. 
fastidious selection. (The Latin root of elegant means to choose, or select.) 
This was a reasonable objective in view of the Jater difficulty in making 
manipulations with “inelegant" models .Man-years and lifetimes were 
devoted to the creation of more elegant proofs, more elegant theories, and 
more elegant methods of analysis. We are still Impressed by the Principle 
of parsimony, expounded in the fourteenth century by William of Ockham. 
He suggested that, when in douht, we should accept those conclusions that 
follow from the least number of propositions, assumptions, and steps In 
reasoning.* 

Such a philosophy is inappropriate when the costs, time, and resources 
required for manipulating relatively cumbersome definitions, data, or pro- 
cedures can be greatly decreased by using electronic computers and similar 
devices. If \vc take into account the total effort of defining a system and 
analyzing it, we usually find that the optimum approach is to treat the 
research process itself as a system and to balance the effort expended in 
defining the system with the corresponding effort and cost of analysis. By 
approaching the system in this way, many forms of system definition and 
analysis that would otherwise be inelegant are, when modem data-proecssing 
equipment is available, the most efficient ones. Thus, a lengthy step-by-step 
algorithm (or set of rules for solving a problem), although mathematically 
inelegant compared to a “simple" formula, may be more efficient than the 
"simple” formula when a computer is used for the analysis of a system 
problem. The listing or classification approach to system simulation dis- 
cussed in Chapter 4 is an example. It is aesthetically untidy, but highly 
efficient. 


From another viewpoint, the drastic elimination of detailed assump- 
tions, computing steps, and relationships to be handled can be harmful if 
we enforce simplicity for its own sake. The concept of elegance conflicts 
wrth the facts of life in many systems, which can be inherently messy. 
To eliminate such messiness” in the name of simplicity destroys one of 
the Mental characteristics of the system to be defined. The required 
prahebon or control ability may be lost by stripping the burden- 

some filigree. 7 

be J h l P ,?’ 0Se l s ™P“ ficatiOT discussed here is to create a balance 
behveen the variety of the system to V handled by the analyst and the 
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We should also distinguish between intellectual and physical simplifica- 
tion. Simply because an analyst views a system in more or less detail does 
not mean he must necessarily engage in a physical simplification of the 
system (although this may be the result of his investigation). The two ways 
of eliminating detail should not be confused: Because the housewife simpli- 
fies her view of an automobile to a key, an accelerator, and a brake does not 
mean that she must have the motor removed. We will now discuss such 
mental simplification. 


DIRECT SIMPLIFICATION METHODS 

For our discussion, we can divide simplification methods into two main 
categories: First-order methods, which directly reduce complexity; and 
higher-order methods (see Chapter 8), which simplify a system indirectly, 
through a series of steps. This split conforms to our discussion of system 
hierarchy in Chapter 5. 

The Strategy of Direct Simplification 

Direct attempts at system simplification usually involve the actions of 
elimination and grouping, either of which directly decreases distinctions 
that need to be made in a system definition. Let us first consider a few 
general examples that illustrate the strategy and then examine some specific 
methods. 

In defining a system for a given purpose it soon becomes apparent that 
large blocks of detail are irrelevant and should be eliminated. Such elimina- 
tion implies the analyst has imposed implicit constraints in his system defini- 
tion. In fact, no system definition could be made if constraints were not 
introduced, because without constraint the system would have no structure. 

A set of such constraints, if increased, acts as a filter to narrow the 
set of possible elements and relationships that will be considered. To illus- 
trate, consider again the research team of Chapter 3, which could take on 
projects of type A or B. 

Example: If no limit were placed on the number of each land of project 
the research team could take on, there would be an infinite number of 
combinations of A and B type projects allowed. By restricting the total 
number of projects to three or less, however, a limit of ten possible 
combinations results. The stated restriction filtered out, or eliminated, all 
other possibilities. 

To continue with the same example, suppose we also required the 
research fpam to have in process at least one project (of either type) at 
all times. Then, the state (0,0), which corresponds to complete idleness, 
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would be excluded, and the number of combinations allowed would be 
cut to nine. Similarly, if we required the research team to have in process 
at least two projects, but not more than three of either type, the number 
of allowed combinations would be seven, and so on, as may be seen from 
the two-dimensional state diagram of Figure 3-8. 


The need for careful specification of constraints should now be clear. 
Without any constraint, the system to be considered becomes impossibly 
large, because it would then include all elements and relationships that even 
remotely bear on the objective of the system definition. On the other hand, 
it is also possible to constrain the system so that no elements or relationships 
remain. 


Thus, in the example above, if we required the research team to work on 
three jobs or less, and at the same time required the team to have at least 
four projects of type A or four of type B, no state would satisfy the require- 
ments and the restriction, and the system could not be defined. We would 
say that the constraints chosen were inconsistent, because no “feasible" 
system would result from the specification. 

, C J'° Csi,,S ™ ilab,e constraints reduces the complexity of a system 

by^‘li‘mLto“ USe the <toU ” Ct “ >nS nMde to thal *&“«<» can be reduced 

Grouping system distinctions also directly simplifies a system definition. 
Example: Ag am, i„ the example of the research team suppose we 
roedfeta*. between projects of type A and tj^c B^and just 

will leave fewer dhtLioTthtbefe; ^ ‘"Trte t * 

considered. * * w0 tensions (A and B) were 
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gun control system is constrained by the 



Direct Simplification Methods 159 


physical construction of its weapon, and depends upon the fact that targets 
clo not move at perfect right angles, A certain range of temperatures and 
pressures is required to cause a chemical reaction, A driver must stay on the 
road, maintain a headway greater than zero, and not run out of gas. 

Similarly, we find herds of animals, clusters of people, banks of machines, 
political factions, component sections or modules, repetitive tasks, and 
organizational departments that naturally suggest not individual notation or 
distinction, but group treatment. 

When such natural constraints and groups appear, they may be ex- 
ploited to reduce the variety in the system to be defined. Such constraints 
and groups are not only helpful in reducing system variety, but are also 
an essential part of the system deBnition. 

■'Artificial" and “arbitrary’' constraints and groups 

In addition to natural constraints and groups, the analyst may choose 
to introduce those of his own making. Thus, the analyst may wonder what 
will happen if he constrains all production equipment in a shop to work at 
60? efficiency. The architect (or his client) may prefer colonial architecture 
to all others. A driver may want to travel from New York to Chicago, but 
not use a turnpike. In the same way, we may arbitrarily create artificial 
classes and groups, stratify data for ease in computation, or eliminate 
distinctions by reference to a higher, broader mode of classification. Do we 
really need to know the number of males and females in Louisville, or 
would knowing the total population be sufficient? Couldn’t we deal with 
small, medium, and large, rather than with the exact garment size? Would 
it not be possible to report the output of the department in dollars, rather 
than by dollars for each distinct product line? Why not build in modules, 
or blocks? 

If we follow this same train of thought, our next question might well 
be: “When do we use elimination methods, and when do we group?” 

From the examples above, and those to follow, we see that elimination 
methods, which, result from, the application of constraints, reduce detail by 
reducing the area, region, or set of points to be investigated or contained 
in the system definition. In this way, the system definition includes elements 
and relationships that are important within or at the boundary of the 
constrained region. Thus, the resulting system definition is useful for de- 
tailed investigation and analysis of variations within or at the border of the 
defined region. We simplify by elimination when the system objective re- 
quires optimization, isolotion, and search of detailed action. 

Conversely, it may be desirable to make comparisons or tests between 
large blocks of elements and relationships. We simplify by grouping, classi- 
fication, and consolidation of detail when the system objective requires esti- 
mation, comparison, and test between blocks of detail. 
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Some combination of grouping and elimination is the rule in most 
system definitions, but the distinction between the approaches is strategic 
when we plan direct simplification procedures. 

With this background, we will now consider some detailed examples of 
elimination and grouping techniques. 


Elimination Methods 

Many of the approaches to elimination and grouping ate similar, but for 
our present purposes we will discuss three general methods of elimination 
currently used ( l ) Restricted ranges of measure, or interest, ( 2 } logically 
or statistically restricted combinations, or patterns of acceptance, and (3) 
threshold and discrimination methods. 

The first approach seeks to put a small box about a continuously described 
area, which will be of later interest The second approach attempts to 
reduce the number of points that must be considered within a box. And, the 
Unrd^ approach,^ usable only when observations can be ranked or scaled, 
uses adjustable elimination, so that the analyst can experimentally partition 

T Sma5kr Subsystems ‘ approaches may be used 
alone, m combination, or in conjunction with other simplification procedures. 

Restricted ranges oi measure or interest 
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7-1 The output ol a system * as a function of Inputs x and y. 

know this much, our search for the maximizing values of x and tj is 
greatly simplified, because the search area is restricted, and alternate, 
nonglobal, maxima are excluded from consideration. In short, if you are 
looking for the highest mountain in the world, to know that it is some- 
where north of India is helpful. 

In many systems major discontinuities in system performance occur 
as system variables move from one level to another, even though the 
variables themselves may be continuous. 

Example: The familiar break-even chart shown in Figure 7-2 may be 
linear until the capacity of one-shift operation is reached. When two- 
shift operation starts, added fixed costs of supervision and commitment 
j'ump at the limit of one-shift volume. Systems of this type are simplified 
if they can be considered piece-by-piece, thereby avoiding jumps and 
discontinuities in the restricted definition. 

Similarly the area of system interest may be limited by trial 
boundaries, which limit the area within a larger system for which a 
detailed system definition is required. 

Example: A production problem arises concerning poor output quality. 
The industrial engineer for the shop obtains a flow layout, showing the 
position of the machines on the factory floor. The engineer then checks 
off areas on the floor, or major blocks in the process that do not appear 
to be the source of trouble. He then carefully investigates the area re- 
maining within these trial boundaries. 

Thus, one direct method of system simplification is to consolidate the 
area that needs further detailed system definition or search. 
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7-2 A breakeven chart showing a jump at the limit of one-shift capacity. 

Logically or statistically restricted combinations 

System simplification can also be obtained by restricting attention to 
specified combinations of conditions, or to patterns of operation of interest 
to the analyst, and by eliminating all others. (Elimination of the unwanted 
patterns is equivalent to selection of the wanted patterns: The approach to 
the final result is a matter of convenience.) 

In many sequential or combinatorial system problems, most possibilities 
that could he considered can be eliminated became they are impossible or 
illogical on technological, procedural, or similar grounds. 

Example. A production operation consists of assembling a bolt (B), a 
lockwashet (L), and a flat washer f\V) in that order, as shown in 
Figure 14. Theoretically, the number of possible sequences of three 
things is 3’, or 6. But, on physical and technological grounds, only two 
of the possible sequences are suitable. We can assemble LAV, and then 
insert B, or, we can reverse the process, holding B and assembling L and 
W on B. All other combinations are either physically impossible or 
contrary to specification. 

Example: A sequence of order -processing operations involves (1) re- 
ceipt of mail, (2) order editing. (3) credit check, and (4) product ship- 



BLW J 
BV/L X 
Y/Bl X 
LBW X 
LWB X 
Y/LB J 


7-3 An »«wbty cpefaiion. 
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ment. Theoretically, the number of possible sequences is 41 or 24. Because 
none of the operations can start before the mail is received, and because 
a credit check is needed before product shipment, the remaining prac- 
tical possibilities concern only whether to provide a credit check before 
or after order editing. There are therefore only two real possibilities, 
not 24. 

If a large set of different system elements or relationships, each of which 
is described in many dimensions, must be simplified, a smaller set can be 
obtained by sorting out, or extracting, those items with the desired com- 
bination of dimensions. Moreover, by increasing the number of dimensions 
in the sort, the simplified set may be reduced to any desired size. 

Example: From a deck of 52 playing cards, we may extract 26 that 
are black, 13 that are spades (and black), three that are black spade face 
cards, and only one black ace of spades, (There are no blue cards, so 
that it is possible to specify a dimension, or set of dimensions, that will 
produce no set members. In general, however, an increase in the number 
of dimensions required in the extraction will decrease the number of 
items extracted.) 

Example: A personnel file consists of 1000 names. If we ask for male 
employees, we extract 600 names. If we ask for males over 40, we get 
300. If we ask for males over 40 who have attended four years of college, 
we get 50. And if we ask for males over 40 who have attended four years 
of college and also speak French, we get 2. 

It is frequently useful to relate items extracted in such a logical elimina- 
tion procedure to a space or time scale, using maps, graphs, or charts. Be- 
cause highly specific human and physical characteristics tend to cluster in 
place or time, the mapping procedure may suggest added dimensions, or 
combinations for extraction, that will aid in further system simplification. 

Example: A criminal investigator wishes to define the system of opera- 
tions of a criminal who has committed robberies in a given mode of 
operation in small jewelry stores on several occasions. The physical loca- 
tions of the crimes, when plotted on a map, are found to cluster in a given 
locality and always to occur on one-way streets. The investigator now 
narrows his field of interest to small jewelry stores on one-way streets 
in the area, simplifying his stake-out problem. Moreover, by searching 
the historical files for similar crimes, he can extract a smaller list of 
suspects than he could without the one-way street dimension. 

Many systems will be simplified if their operation is considered only at 
the logical extremes of activity. 
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Emm Bfc. Given a complicated equation for a system description, a 
mathematician might first ask “What happens when * tec™* ‘ 
large or very smaU, « or OT In this way he can bypass much of the 
complexity of the equation and greatly simplify his approximations. Such 
approximations give the limits of system operation under extremes oi 

V Similarly, the mathematician might ask “Under what input conditions 
will the system output he very large, or very small, » or 0? In this 
way simplified approximations may reveal critical combinations o! 
inputs that can cause extremes in system output. For example, in the 
feedback equation [Ojlx) = k/(l + i*)l, when the product of system 
amplification and the feedback factor equals minus unity, the denomina- 
tor becomes zero, and the system “explodes" or breaks into oscillation. 
The point at which such breakdown occurs may be the main point of 
interest in die system definition. 


Using this same form of argument, we can show that some systems 
operate under optimum conditions only at the extremes of activity set by 
specified constraints. The possible or feasible modes of system operation in 
these systems may be numerous (or infinite), but the number of "optimum 
combinations of conditions is drastically limited, even for highly complex 
systems. Linear programing is a specific method for exploiting this method 
of system simplification. 


Example: Figure 74 shows a simple system with two inputs x and y 
representing the units of two products that may be manufactured in 
a given shop. Both products require time on both machines A and B, 
and, because time on machines A and B is limited, an increase in X 
means a decrease y, and vice versa. The equations defining the machine 
restrictions are shown in Figure 74, and assume 400 hours available 
per week for each machine type. The "feasible area,” representing pos- 
sible combinations of t and y, shown shaded, is a consequence of the 
hnear restrictions imposed by machines A and B. The assumption, of 
course, is that to complete two units of x (or y) takes twice as long, or 
consumes twice the specified machine time, as the manufacture of one 
unit of x (or y). 


Suppose we obtain $50 profit for each x and $75 profit for each y, 
which is a linear measure of system output. The point of maximum profit 
will he on the outer perimeter of the feasible region, a logical con- 
sequence of the straight line geometry defining the system. The optimum 
conditions for the product mix of x and y may be obtained by progressive 
0f *!. con «® s «* feasible region (maximum profit of 

^°- 3) by substil “tion of the comer values of 
x aod y fa u,e object, ve tactic' = =50, + 75y at each of the possible 



Direct Simplification Methods 165 


y 



7-4 A simple linear programing example Illustrates the principle of logical elimination. 


comers of the feasible region. The Simplex Method of comer evaluation, 
which indicates the order in which comers should be evaluated, further 
limits the number of comers that must be evaluated . 2 

The important point here, however, is that the system of possible 
combinations of x and y has been immeasurably simplified by exploiting 
the geometry of the linear system in combination with the linear objec- 
tive of optimization. Moreover, by changing a constraint, the set of 
possible optima can be further reduced. Suppose the time available 

'The Simplex Method is an algebraic algorithm, or set of procedural rules for compu- 
tation, for finding the optimum solution to a linear programing problem of the type 
illustrated in Figure 7-4 The virtue of the method is that it is computationally efficient, 
produces the optimum (if there is one) in a finite number of steps, and requires fewer 
evaluations of extreme points than a complete enumeration and evaluation of all possi- 
bilities. The method is described in J- G. Kemeny el al.. Finite Mathematics tcith Busi- 
ness Applications, Prentice-Hall, Englewood Cliffs, N.J., 1962, pp. 384-392. A com- 
plete treatment, with theory, may be found in R. P. Dorfman, P. A. Samuelson, and 
R. M. Solow, Linear Programming and Economic Analysis, McGraw-Hill, New York, 
1958. The Simplex Method provides a general solution to problems having linear con- 
straints and a linear objective function, and is not limited by the number of constraints 
or variables. In particular problems and computational tasks, the number of variables 
or constraints imposed may be limited by the capacity of the computing equipment 
available, not by the procedure. 
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on machine A is increased to 1000 hours, with the resulting restriction 
shown by the dotted line in Figure 7-4. The number of comers of the 
feasible region is now reduced from four to three, so that the search 
for optimum conditions can be correspondingly simplified. 

The importance of linear programing as a logical form of elimination is 
greatly enhanced as the complexity of the system increases, and conse- 
quently, this approach is often used in routine searches for optimum condi- 
tions in linear systems with several hundred variables and thousands of 
restrictions. 


Statistical sampling procedures also oSer a way to filter system elements 
and relationships, while retaining the basic pattern o! system operation. 

When we draw a random sample from a population, we select a portion 
of a larger set (eliminating the remainder) so that each member of the 
sample is drawn from the population with equal (or blown) probability. 
The members of the sample are then representative of the population, 
or larger system, and may be used to simplify further definition, because 
we can now deal with fewer representative elements and relationships. 
Example: Suppose the system elective is to find the minimum cost se- 
quence of operations when 100 jobs must bo sequenced on one machine. 
The number of possible sequences is 1001, a number too large to evaluate. 

comtortedV? 0 * a Samplt; ot 1000 > ,0!siUl! “ourancs is 
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possible transactions in the paperwork system would be complex. How- 
ever, if \vc do not consider transactions that occur less than ten times 
a week— because they are less frequent than the threshold set-the major 
flows in the daily paperwork will stand out clearly, and the system of 
flows will be simplified. 

The ability to use threshold discrimination methods successfully in 
Systems simplification is based upon several facts, observable in most systems. 

First, as was the case in our mapping of clusters produced by logical 
elimination, clusters of frequency, intensity, size, or other measure are 
usually found in most systems. 

Second, the efficient isolation of such clusters by threshold methods is 
a result of the highly nonlinear distribution of most system measures 
throughout the system to be simplified. For example, if there were an equal 
transaction frequency for all types of paperwork in the above example, 
simplification by invoking a threshold would not work. But, distributions 
of tliis type are not uniform; they are highly skewed. A few, frequent, large, 
important events, transactions, variables, or relationships and many of 
lesser measure exist.* 

Finally, as a consequence of the clustering effect and the nonlinear 
distribution of system activity, it is possible to use threshold techniques 
not only to simplify a system by eliminating many elements and relation- 
ships from consideration without materially affecting the major patterns 
of system operation but also to split, or partition a larger system into a 
number of smaller systems which thereafter may be treated independently. 

Because threshold methods in system simplification are important, some 
evidence of the distribution of system measures is of interest to experienced 
systems analysts. A few examples will clarify the generality of threshold 
application. 

Example: From studies ranking words in English (and other languages) 
by their frequency of use, we find a distribution that declines rapidly 
(in a roughly “exponential” fashion). One hundred short words (a, and, 
the, when, . - constitute 5QJ of typical English text. Another few 
hundred words constitute the next 30?, and so on, for all the words in 
the dictionary. 

This fact, investigated extensively, by G. K. Zipf, 4 provides a basis for 
proposals for a Basic English vocabulary so that language instruction may 
be simplified. Speed courses in typewriting and shorthand, and training 


* Accordmc to General Maxwell D. Taylor’s records { The Uncertain Trumpet, Harper & 
Row New York, I960, p 91), between October 6, 1955 and March 31, 1959, the 
Joint Chiefs acted upon 2977 issues. Of these, 2954 issues were unanimously agreed 
Upon, thus, only 23 were left in disagreement. These, however, were the “blue chip” 

^issues. | jumcn Behavior and the Principle of Least Effort, Addrson-Wesley, 

Cambridge, Mass, 1949. 
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in similar repetitive jobs have often been structured to exploit this fre- 
quency distribution. 

The nonlinear distribution of word use is typical of a wide range of 
systems involving human actions that exhibit what Zipf calls “The 
Principle of Least Effort." Zipfs law states that individuals tend to minimize 
their expected expenditure of effort, thus producing the observed nonlinear 
distribution of activity. 

Certain activities are performed more frequently than others— and, in 
general, the most frequent activities are those the participant estimates to 
be the easiest. Whether this activity distribution is due to habit, physical 
factors, or other causes, the experimental evidence remains. Wc take many 
more short trips than long ones, place orders for a few items more fre- 
quently than orders for many items, tend to buy a few things more 
frequently and in greater volume than others, tend to live near people 
with whom we frequently do business or converse, make more short tele- 
phone calls than long ones, and talk to the same few people most of the 
tune. Most people even find their mates within a few blocks of their resi- 
dence. The distribution of almost any measure of human activity falls off 
as the amount of effort” increases. 
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7-5 The nonlinear distribution of Item use fn a typical Inventory list Is an example of 
Ziprs Law. In A-B-C systems of Inventory control, different control methods are used 
for high, medium, and low use Items. 

outnumber such serious crimes as murder and grand larceny. A threshold, 
or priority is established so that attention is first directed to the few 
crimes that are the most serious offences against society. 

In New York City, for example, police do not begin a detailed investi- 
gation of a burglary until the theft exceeds $10,000 and in most opera- 
tions an emergency code is assigned each call to give a priority to calls 
currently outstanding.* In this way, activities that are less important 
to the system at a given time (i.e., activities below the “seriousness 
threshold’’) are screened out. Action is taken on the few serious crimes. 

Example: The design and analysis of power, transportation, and com- 
munication systems is often based on “peak load” requirements. We 
know that, if the system can handle the few peak loads, it can also 
handle the lesser demands of off-peak times. Thus, major attention is 
placed on a study of the few peak load conditions, and on actions for 
controlling or altering the few peak requirements. 


‘Susan Black, “A Reporter at Large: Burglary,” The New Yorker, (Part I) Decem- 
ber 7, 1963, pp. 63-128, (Part II) December 14, 1963, pp. 89-152. This fascinating 
report provides many examples of criminal modes of operation which illustrate cluster- 
ing, threshold effects, and the observation of constraints in burglaries by both criminal 
and police. 
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In human activity, it is often pms.ble to exploit the nonlinear totribu- 
honof activity, or dusters of activity intensity to create system simplification. 

Physical measurements and sample estimates also exhibit 3 we ' * 

nonlinear property that can be exploited by threshold methods. The field 
of statistics is largely concerned with this fact. . A 

For example, the Central Limit Theorem in statistics 1 states that the 
averages of sample measurements taken at random from a given population 
will tend to be “normally distributed" about the true mean of the popula* 
tion. The normal probability distribution (like the exponential distribution} 
is nonlinear, although symmetrical. The probability of extreme deviation 
of a sample average from the population average is much smaller than for 
a small deviation. Tins fact is exploited in the statistical control chart. 


Example: One example of a statistical control chart is the x-bar chart 
used in industrial quality control work. In this application, measurements 
are made of a product characteristic for random samples of four or 
five units, and the average of these measurements is plotted on a chart. 
Upper and lower control limits are then placed above and below the 
over-all, or grand, sample average, which is the best estimate of the 
mean of the population being studied. The control limits are set such 
that only a small percentage of the observed results would fall outside 
the limits due to random causes. Attention is then directed only to “out 
of limits" measurements. Trouble-shooting and action-taking may then 
be limited to rare, exceptional cases that are most probably due not 
to chance, but rather to some assignable cause. Arbitrary forms of 
“exception reporting" are also constructed this way. 


Similarly, many standard statistical procedures, such as regression and 
correlation analysis, are often used to eliminate variables and relationships 
that do not contribute significantly to system description. In other words. 


TTie Central Limit Theorem, according to A. M Mood, is the most important theorem 
m statistics from both the theoretical and applied points of view. “And it is one of 
the most remarkable theorems in the whole of mathematics. A great many eminent 
muhemauckns <De Malm. Laplace, Cairn, Tchebysheff, Liapoundoff. Levy, Cramer, 
and others) have contributed Ic its development. The theorem is this: 

1/ a population has a pna? variance and mean then the distribution of the 
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fi !s IS variance be Emte is not a critical 

SSS the ranee of is concerned because in almost any practical 

sanly be finite “ / F rom w hich case the variance must nects- 

A M. M«d andFTcSS^ f ° l f* ™T J of Suaftt{a ‘ 2nd rf, p. 136, by 
York. Used by permission.) * CopyTJeht I963 - McGraw-Hill Book Company, New 
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these techniques eliminate factors below a statistical significance threshold, 
which is set by the analyst 

Example: A marketing manager hopes to build a "model" that nail 
predict sales of his firm. He has data on monthly sales and also data on 
several other possible factors, say, box-ear loadings, construction starts, 
and the like. He may propose a model of the form 

y — a x x t 4- tfjjr, + <?,x, -f -*• -f- 

where y represents his monthly sales, the a'% are constants (to be deter- 
mined), and the x's arc the values of the available statistical factors. If 
the statistical regression analysis shows that a given ax term does not 
contribute significantly to the value of y, then that factor will be 
eliminated. Because formal statistical methods, such as regression and 
correlation analysis, are easily automated for computer evaluation, we 
can often use such elimination methods to develop predictive system 
models, when we have volumes of historical data. 

Many physical systems, because of their design or method of construc- 
tion, also produce nonlinear measures that may be exploited by threshold 
methods. Such results can occur, because a system is designed to work with 
varying efficiency over a range of input conditions, because of a few in- 
stabilities, or because of isolated design or construction weaknesses produce 
malfunction. 

Example: Hardware systems* are usually designed to work at their 
best under their conditions of most frequent use. A common practice is 
to eliminate the many inefficient operating conditions by setting thresh- 
old specifications above and below which a system should not he 
operated, and conversely to simplify design by making these limits as 
narrow as possible. 

Example: Hardware systems also often have a few specific conditions 
under which operation is unstable, within the desired operating range. 

If the few serious instabilities can be screened out by threshold methods 
and stabilized, the system will usually work satisfactorily under the 
other required conditions. 

Example: A television repairman receives a portable television set of a 
certain make that has the following symptoms: shrinking picture, inter- 
mittent static, and flashes on the screen. He immediately investigates, 
by inspection and test, the operation and condition of the selenium 

* We often call mechanical and electrical systems used in Industry and in military 
service (eg, computers, electronic control systems, etc.) hardware systems to dis- 
tinguish their physical equipment from the organized instructions and data flows handled 
by them, which are called software systems. 
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rectifier in the low- voltage power supply section of the set. He does 
this because of his previous experience with such equipment, or from 
a manufacturer-supplied trouble-shooting list (based on probabilities 
of collected histories), relating the symptoms with the most frequently 
occurring causes of the specified trouble. 

Now, we have seen how threshold methods can be used to eliminate 
the many system distinctions that do not exceed a set threshold. But what 
happens when the threshold level for a given measure is applied to oil 
the elements or relationships in a complex system? 

As the acceptance threshold is raised for a given system measure, many 
elements or relationships, important at finer description levels, arc eliminated. 
For example, by raising the threshold on a relationship measure, we discard 
the connecting links between system elements as we move to higher thresh- 
old levels, and disconnected groups of elements and relationships remain. 
When this effect can be produced, the disconnected subsystems may be 
treated as if they were independent, thereby greatly reducing system 
complexity. 

Example: The traffic patterns between five cities have been studied, 
and die number of trips per week between each of the five cities has 
been tabulated, then mapped as in Figure 7-6. The complete tabulation 
of two-way trips involves five elements and twenty relationships (if 
traffic within the same city has been omitted). Suppose these data are 
Screened by a threshold, so that only traffic flows that exceed 1000 trips 
per week in a given direction are shown. The new map is not only 
greatly simplified, but also broken into two parts, representing two 



7-8 The traffic pattern between 
a week, or more, the dotted tin es 
trip relationships are eliminated 
pendent systems. 


five cities. The solid lines in (a) represent 1000 trips 
represent (ess than 1000 trips. St the less-ttiarviOOO- 
as in lb), the larger system separates into two mde- 
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i^ajor traffic patterns. At the new level of discrimination, the two 
patterns may bo considered independent, and treated as two simpler 
systems. 

In numerical data displays, we can accomplish partitioning of this 
type simply by rounding or truncating to fewer significant figures. 

Note also that more complex forms of threshold applications can be 
applied when a measure contains several dimensions. We can apply 
selected thresholds to each dimension of such measures to achieve a 
desired effect 

Example: In radio communication work using amplitude modulated 
signals, such as those heard on the common table radio, the signal heard 
may be characterized by two dimensions: frequency and amplitude. A 
range of radio stations across the dial may thus be illustrated as in 
figure 7-7, which i« a phi of amplitude versus frequency at a given time. 

In the usual home receiver we tunc the set to a given station by 
using a form of “slit discrimination,'* which effectively eliminates sta- 
tions above and below the frequency to which the set is tuned, yet 
passes the signal within the slit, as shown by slit A. It is also possible 
to discriminate in terms of amplitude. For example, if we establish a 
lower threshold on the amplitude, we could eliminate low-level back- 
ground interference. Similarly, if we establish a higher threshold, we 
could eliminate high amplitudes that produce violent static crashes. 
This complicated set up would allow us to hear only signals that passed 
die conditions defined by the small box at the intersection of the vertical 
frequency slit (A) and the horizontal amplitude slit (B) shown in 
Figure 7-7. Most professional communications receivers have controls for 


A 



7-7 the representation of an amplitude modulated radio signal (the wiggly line) by 
the dimensions of frequency x and amplitude y. 
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adjusting the width and position of such discrimination slits inde- 
pendently, so that only the desired signal may be heard. Although this 
procedure is performed electronically in the radio receiver, the same 
process can generally be implemented by adjusting statistical ‘’slits” for 
acceptance and rejection. As before, the process is more effective if the 
data are widely variable and nonlinearly distributed for the dimensions 
used. 


Grouping Methods 

Let us now discuss the second form of direct simplification— grouping 
methods. We again cite three major approaches: (1) Statistical grouping, 
(2) logical and arbitrary grouping, and (3) the use ol prototypes, or a 
multidimensional cluster of characteristics or measures, to represent a larger 
complex group. We shall see how the prototype (as described here) may be 
used as a building block in more complex system definitions. 


Statistical grouping and the use of descriptive measures 

" clmi< I ue ri Ad description of mass data by 
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Logical and arbitrary grouping 

We can simplify a system by defining certain observations, which may 
be distinctly different, so that they belong to a more general class. In effect, 
we thus gain simplification by a reduction In dimensionality. 

Example: An automobile fleet owner wants to get a quick picture of 
the maintenance problems of a new fleet of cars for comparison with 
previous fleets. In the description of the system of new cars, such 
problems as cracked distributor, broken timing chain, failure in high- 
tension lead, replaced points, replaced plugs, replaced condenser, re- 
placed coil, and the like are all grouped under the category “ignition 
problems"— a logical collection. Although he may later wish to explore 
the details further, the fact that one fleet type has a much greater per- 
centage of ignition problems than another may be all the information 
currently needed by the fleet owner. 

We may also achieve simplification in many cases if we arbitrarily 
convert a continuous variable to a discrete variable, which may assume 
several levels, or attributes. 

Example: A list of customers giving the exact dollar amount of pur- 
chases made during the past year is being prepared. Each customer 
is ranked by the exact number of dollars-and-cents he has spent for 
purchases. Because there are 1000 customers, the list is sizable. A simpli' 
fled approach would be to classify the customers arbitrarily into small, 
medium, and large purchase categories, then display the number of 
customers in each class. 

An extension of the variable-to-attribute method of simplification is 
to use "zones of indifference.” By blocking unimportant variations we get 
approximate, yet greatly simplified, economic computations, simplified dis- 
plays of complex data, and simplified systems definitions. 

Example: An analyst begins a study of truck routing, only to find a 
wide range of trip lengths to be considered. Thus, a vast amount of 
detailed data must be collected on the cost of a distinct trip length. 
Suppose the analyst finds his resources inadequate to the task of such 
detailed observation, and moreover, estimates the results of his study 
would not justify the cost of processing such detail. He may therefore 
decide to divide the area under consideration into ten cost indifference 
zones that will estimate roughly the cost of trips from the main ware- 
house. The resulting simplification— similar to the cost zones illustrated 
by a parcel post map— allows the analyst to make an appropriate cost 
comparison of alternate dispatching rules. The analyst can adjust the 
cost of data collection and processing either by the precision of his 
analysis, or by increasing, or decreasing, the zone widths. 
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The use of indifference zones may be extended to the construction of 
contour maps or indifference curves that relate a category or class of measure 
to xy or other coordinate systems. Such displays are familiar in topological 
maps, showing spaced contour lines of equal depth; weather maps, 
showing isobars, and economic diagrams, showing iso-profit, or iso-cost 
lines, and the like. 

Example: The use of lines of value (iso-lines) produces an area of in- 
difference in which variations within the band between two lines are not 
detailed, but may often be inferred. The variation will at least be less 
than the interval between contours. For example, in the map in Figure 
7-8, the detailed distribution of soot fallout in New York City is shown 
at the 80 tons per month and the 70 tons per month contours. Although 



i»«m. 'mTlmcnsto ""s'o! t? 1 T Va '“ 8 01 a system measure- 
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assume that the amount allocated to the strategic bombing force 
spent most efficiently to maximize offensive potential, and that tn 
amount allocated to air defense was spent most efficiently tomcrease 
defensive potential The sacrifice m defensive potential to obtain an 
increment in offensive potential is shown by the “production possibility 
curve in Figure 7-9a. For a larger total budget, the production possibility 
curve would move up and away from the origin, so that we may think 
of a family of such curves, or contours, connecting the maximum efficient 
combinations of offense and defense that can be bought with $B-billion, 
or $2B-bilhon, or $3B-billion, and so on. (The curses shown and their 
scales are hypothetical, the determination of the points on such curves 
would be a matter for detailed quantitative analysis. The equal trade- 
off” contour, however, illustrates the system relationship connecting 
(1) offensiv e target destruction, (2) defensive kill potential, and (3) 
the size of the total combined budget for both operations, with the 
assumption of most efficient use in either example. 10 
We now ask how we might find the "optimal” combination of offensive 
potential and defensiv e potential, as defined above? 

If it were possible to find a common measure, a commensurate measure 
of effectiveness, to scale offenshe potential and defenshe potential as 
desirable alternatives for a given national security level we could plot the 
trade, or indifference curves, representing equally valuable alternatives for 
that given national security level. Such objective indifference curves, for a 
nonlinear system of this type, are usually convex when viewed from the 
origin-as we relinquish more of our defensive potential, we demand propor- 
tionally more offensive potential m return, and vice versa. A family of 
such objective indifference curves may be plotted, representing various con- 
tours or connected points of equal objective value to us, as in Figure 7-9b, 
For a fixed budget of SB-billion, the optimum combination of x and y 
(defensive potential and offensive potential) is the point P, which is on the 
production-possibility curve for SB-billion, and at the same time on the 
highest national security indifference curve possible. 

This example of nonlinear programing does not illustrate the techniques of 
solution, but rather shows how indifference can simplify a system definition.* 1 

’“Hie production possibility curve shown is a nonlinear constraint. Any combination of 
b ^ the . cons ^ « On the curve, each combination .s said to be 

, ‘t “ maezse r » offensive potential necessitates a decrease 
71,6 quest,0 “ of firuLn S ti'e “optimum" efficient combination 
k, P roWa ® “ conceptually similar to the linear programing example of 

M widely spaced, or simplified, lines of indifference. 
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Prototypes 

If the dimensionality of the elements or relationships to be described is 
inherently so large that grouping under a more general category will not 
provide the needed detail, we often use a prototype, i.e., a typical block of 
elements and relationships we can use instead of many distinctions. 

Example: A manufacturer has a number of distributors throughout the 
country. He knows they differ. The New England distributors are a 
different breed from those in the South and West. He wants to devise a 
system of distribution to include these differences, but knows he cannot 
have a different system for every distributor. He asks for a picture of 
the typical New England distributor as compared with the typical dis- 
tributor in other regions. He wants a few prototypes on which to base his 
decisions, not a detailed description of all his outlets. 

Example: The insurance salesman who meets many customers daily 
cannot afford the time to learn every detail about each customer, nor can 
he have a distinct insurance plan and a distinct sales strategy for each 
customer. He looks for prototypes in the individuals he meets. He has a 
few distinct sales plans and approaches to match his current needs. The 
shy, wealthy financial analyst gets one treatment; the extrovert middle- 
class grocer gets another. 

The high cost and time required for data collection compels us to use 
prototypes when the elements and relationships to be identified, described, 
and grouped become numerous, scattered geographically over a wide area, 
or require the services of expert analysts for their description. 


Example: For a six-year study of heart disease The University of Michi- 
gan selected Tecumseh, Michigan (pop. 9500), as a typical U.S. City. 12 
Settled in 1824, Tecumseh has several industries, a typical heartland 
Main Street, and a fringe of farms. In this community, 8600 residents, 
nine-tenths of the population, were persuaded to participate in the 
lengthy medical investigation, which required blood tests, urine samples, 
electrocardiograms. X-rays, and complete medical examinations. To col- 
lect such refined data on a national basis— even for a statistical sample of 
individuals— would have been economically infeasible, as well as possibly 
misleading due to variations in data collecting conditions in different 
field locations. 


Time, July 23, 1963, p. 39. 
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Example: For similar reasons, test markets, used in cxpenme^l market 
research are used as prototypes. To maintain secrecy, to permit the use 
of control locations in experimental design, to cut cost and time, an 
by the nature of the experimental treatment (block advertising), sue 
typical, or prototype, blocks of elements and relationships must be used- 

Frequently we wrongly represent a prototype by a set of 
average conditions or measurements-often neglecting possible instabilities 
or gross variations about the average. Or, we construct our prototypes, 
regardless of the field of analysis, by trying to remember and visualize a 
specific and striking example-* most unforgettable character-that epito- 
mizes a range of elements and relationships, forgetting that the prototype 
chosen for dramatic effect may not be the prototype useful for prediction. 

Because the prototype, or simplified model, is the result of combinations 
of system simplification by elimination and grouping, it is important that 
we conclude this chapter with a survey of the assumptions and pitfalls 
encountered in using the first-order simplification methods discussed thus far. 


ELIMINATION AND GROUPING PROBLEMS 


Elimination and grouping, as methods of system simplification, make cer- 
tain assumptions we should discuss further. We will not repeat existing 
works 11 on logic or statistics, but will review two major data-collection and 
prediction problems: stability and variability. 


“ A basic library list of over 300 references is provided by tbe Committee on the Under- 
graduate Program in Mathematics of the American Mathematical Society, P.O. 
Box 1024, Berkeley. California 94701, covering statistics, probability, logic, and other 
mathematical areas From this list, the reader may wish to select (l) W. Feller, 
An Introduction to Probability Theory and Its Applications, Vol 1, Wiley, New York, 
1957, and (2) E Pawn, Modem Probability Theory and Its Applications , Wiley, 
New York, I960, ( 3 ) A. M Mood, and F. A. Cray bill, Introduction to the Theory of 
Statistics, 2nd ed., McC raw- Hill, New York, 1963, (4) W. Quine, Mathematical Logic, 
rev ed., Harvard University Press, Cambridge, Mass., 1951, and (5) D. Kalish and 
R. Montague, Logic, Techniques of Formal Reasoning, Harcourt, Brace & World, 
New York, 1964 as st anda rd works in probability, statistics, and logic. At a more intro- 
ductory level, J. C. Kezneny et aL, Finite Mathematics With Business Applications, 
PreoUcc-Hall, Englewood Cliffs, N J., 1962, provides an introduction to discrete 
probability problems and other topics of interest to students of business, with flow 
charts and diagrams . P. G. lloel. Introduction to Mathematical Statistics, 3rd ed., 
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Stability 

For our system definition to work-for it to predict system output, when 
given system input— we must assume an underlying stability in the system 
described, regardless of the level of complexity or the degree of systems 
simplification achieved by any method. 

If we could not assume such stability, then the elements and relation- 
ships relevant for our system definition now, would at a later date or at a 
different place, be irrelevant. Like Alice, we would be in a Wonderland, 
where every new event was a surprise. 

The basic assumption of scientific investigation and practical action-the 
assumption of stability-becomes more important as we begin to eliminate 
or group detail: Inherently we assume the few elements and relationships 
remaining in our system definition— the few left after elimination, or the 
few left after grouping-will continue to be representative and predictive for 
our system. Even though our system definition may be a gross approxima- 
tion, we want it to be a robust definition, firm under minor changes in 
conditions, and permanently useful. 

This desired stability may be found in deductive systems or logical 
systems by definition. For example, we may define the fact that when we 
turn on switch A and switch B, light C will illuminate, but not under other 
conditions. From a series of such statements, the logical consequences of 
system operation may be deduced with logical certainty. And, we may find 
some unfailing regularities of this type in nature— in astronomy, in physics, 
in the operation of an electronic computer— if we do not push the precision 
and reliability of our predictions too far. 

But, in all real world systems there is variability and the question of 
“statistical stability” remains . 14 

"Statistical stability implies that causes of variation which can be assigned to specific 
factors have been taken into account or removed from a system, so that the output of 
that system can be predicted within given limits, the remaining variation being "ran- 
dom," in the sense that we cannot find other causes. Although, in any practical experi- 
ment, it is not possible to prove or demonstrate that a given sequence of observations 
is random, a number of tests can be devised to deterawne if the sequence of resvdts 
behaves as if it were random in the theoretical sense of being drawn independently 
and "at random" from a given population, or distribution of possibilities which does 
not change with time or the sequence of observations A basic exposition of this im- 
portant point is given by W. A. Shewhait (\V. E. Deming, ed.) Statistical Method 
from the Viewpoint of Quality Control, The Graduate School, Department of Agricul- 
ture, Washington, D.C., 1939, a reference which I» in many libraries. Although statis- 
tical tests can be used to monitor sequences of data collected for analysis, and al- 
though many standard statistical procedures assume random variation as a require- 
ment for their results, it is true that many applied statistical studies have faded— even 
worse, given totally erroneous results— because experimental probabilities, averages 4 
standard deviations, and other statistical measures have been computed from sequences 
of data which are not statistically stable. Since this error in application occurs so 
frequently, the reader should be ever on guard for its appearance in his wort. A 
number of other statistical errors and logical fallacies are noted in M.R. Cohen and 
R. Nagel, An Introduction to Logic and Scientific Method, Harcourt, Brace & World, 
New York, 1934, pp. 316-332 and pp. 376-403, some of which are summarized 
hereafter. 
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Briefly, statistical stability requires that variation in a system measure 
must be “random" regarding tune or repetition; i.c., a system measure must 
not exhibit trends, cycles, or other "assignable causes of variation unportant 
to our system objective, but unaccounted for in the system defaiBon. 
See Figure 7-10. 
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When such ‘“causes” of variation have been removed or taken into 
account, the “scatter” that remains is presumably due to so many different 
factors acting independently and in combination that we may consider the 
variation random, and the measure used to describe a system element, rela- 
tionship, input, or output, “statistically stable.” 

Although there are many possible tests for statistical stability (such as 
the control chart example, page 170), the most important requirements are 
that the statistical measure used, its range of variation, and its “pattern” of 
variation remain statistically stable, in the sense just defined. 

The stability requirement for the statistically described system cor- 
responds to the “single-valued” outcome requirement for the deterministic, 
or logical system, and is a straightforward concept if the predictive nature 
of a system definition is kept in mind. In short, if a deterministic system 
is not single valued, or if a statistically described system is not statistically 
stable, we must redefine the system until these requirements are met. 

Tho Size of Slatisticai Variability 

In addition to the requirement of statistical stability, if we are to compare 
system characteristics for various times, places, or conditions, we worry 
about the magnitude of statistical variation present in a system measure- 
even if that measure is statistically stable. 

For sharp comparisons we require that the within-group variation for 
the measures to be contrasted be substantially less than the between-group 
variation. We can state this somewhat differently in the language of experi- 
mental design. The variation between payoffs for alternate courses of action 
(different strategies, or test “treatments") must be greater than the variation 
between different subjects tested, or repetitions of the test under the same 
treatment conditions (Fig. 7-11). 

Rather than repeat textbook examples to illustrate the effects of instabil- 
ity and inappropriate within/between class variability, we will study 
several data collection and data organization examples that will illustrate 
the two basic requirements. In the following discussion, the reader will he 
aided by understanding that many possible inconsistencies in field data col- 
lection result in statistical instabilities, for the reasons we are about to 
illustrate. It should also be understood that statistical categories for data 
collection are chosen depending upon the variability of the cases that are 
observed and the comparisons that must be made. These points are 
clarified in the next two sections. 

Consistency and Statistical Stability 

Our classification and grouping of observations and cases should be con- 
sistent, so that a correct statistical count or measure will result for each 
group or class defined. 
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(b) 

7-11 An Illustration of with uvgroup and between-group variations, tn (a) the 
various observations are scattered tightly about alternatives A and B, and the alter- 
natives are widely separated Thus, comparison of the alternatives is easy — every 
example of B is better than A. But in (b), the scatter about alternatives A and B 
is large, and the alternauves are not widely separated Then, some outcomes of 
alternative A could be better than some outcomes of alternative B, and there is no 
clear preference for A or B. 


Obtaining a consistent grouping is difficult because inconsistencies 
can arise from so many sources ( i.e., cases presumed to be alike can differ 
in so many distinct ways or dimensions) that the investigator is unable to 
control all of them- A few of the major trouble areas will illustrate the usual 
precautions we can take. 

If a variety of different individuals and 1m estigators, estimators, or 
measuring instruments must be used in data collection, these sources should 
be checked for classification, estimation, or measurement consistency. Tech- 
nically, standardization and calibration describe the required equalization 
of understanding or reference. 
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The problem of multiple-agreement-even under identical circumstances 
- becomes more difficult as the number of distinct dimensions to be measured 
or classified increases, because the separation of distinct cases into one class 
or another becomes more difficult for the individual or instrument. Even 
in straightforward applications, such as the assignment of code numbers 
to a list of inventory items in stock according to clearly written directives; 
many misclassifications can be expected. Training programs and strict 
standards help overcome this problem, as does use of automated measuring 
and input devices, where this is possible. 

If data to be grouped take a long time to collect, the underlying condi- 
tions giving rise to the observed results may change while the data collection 
is going on. When this happens, the data grouped together will not be 
consistent, because the observations at different points in time come from 
different underlying cause systems. Sequential examination of the data dur- 
ing collection, or preservation of the exact time of the observation for later 
test should be required during lengthy investigations. 

For the same reason, a check of consistent category definitions used for data 
collection and grouping is essential regarding historical data, because the 
exact definition of a class may not have been consistent over time. The 
definition of terms used in government reports, for example, are revised 
periodically to meet changing needs and conditions, and the reconciliation 
of such definitional changes is usually a major, but necessary, task. 

A similar argument, of course, holds for data collected at different 
places. The response to identical observation methods may not be consistent 
in different locations: The question may not be understood, the environment 
may cause an unexpected bias, the observations may not be made at simul- 
taneous times at each location, and so on. 

Example: A computer manufacturer wished to develop a historical 
record of equipment sales and rentals so that this information could be 
used in future planning, A ten-year record was to be collected of all 
transactions, classified by stock number, customer number, terms of 
sale, age o{ equipment, and the like, including foreign sales, rentals, 
exchanges, and trade-ins. Clear, written instructions and personal visits 
by marketing experts of the firm initiated the project, and soon a master 
file began to grow in the home office. It became apparent, however, 
that major inconsistencies were present in the data. Sales were reported 
for nonexistent stock numbers and customers; customer numbers were 
not applied consistently; mergers, “spin-offs,” and similar customer re- 
organizations were not handled uniformly; equipment modifications 
requiring a change in code were not recorded; in describing customer 
types, because each foreign office seemed to have a different classifica- 
tion concept, the classes were blurred, and wandered from year to 
year. As a result, much of the data had to he collected again or completely 
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re-edited (mostly by hand, although the source data had been tons 
fetred to magnetic tape). Tsvo years passed before the master file 
was up-to-date and reasonably consistent. (Minor errors, however, are 
still being found.) More than 60J of the development cost of the master 
file— which grew to nearly one million source documents— was spent on 
establishing consistency and stability chucks, and on studies of file 
simplification methods that would correctly produce comparative sum- 
mary reports and files. Such an experience is common when data must 
be obtained from many people, over a period of time, from many places. 


Statistical tests can check the desired stability of observations over time 
or place, or conversely point out the observations unlikely to be equivalent. 
However, they must be used with a full appreciation for the stability and 
variability of the underlying system. For example, contrary to popular be- 
lief, the application of standard statistical formulas for computing a “sample 
size " does not produce a helpful (or correct) answer unless stability of the 
cause system being sampled can be assumed: There is always a trade 
between too large a number in a sample group on the one hand (to gain 
simplicity and precision of description), and too small a number on the 
other (to assure stability and consistency in grouping), a balance not 
included in the standard formulas. 

Because it is impossible to collect every fact about every system element 
or relationship, the possibility exists that one or more important distinctions 
will not be collected, and drastically different data will be grouped errone- 
ously. Moreover, in many field data collections, it is impossible to return 
to the observation point in time or place to enlarge the distinction list, so 
that the collected data may become worthless, if the inconsistent results 
are found, or, perhaps, worse if the inconsistencies are not found. A strategy 
to avoid this unfortunate impasse is to pretest data-collection methods and 
preanalyze the data collected on a small scale before initiating a large-scale 
data collection effort. 


Stratification and Variability 

When 've choose a category and its boundaries for computing a statistical 
measure (or for logical comparison) we have seen that we should have 
small wjthin-class variation compared to between-class variation, particu- 
larly if we are to compare classes. The same rule holds, of course, in the 
selection of a prototype, or element-relationship block. The more tightly 
variations from (he prototype cluster about the prototype description? the 
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SupI>0 “ Z” wiA crMle 1 P ra, °‘vpe of two villages, using 
ZZt Tab ° 71 !b0WS value and range “tout the 

average for ullages ABC and XYZ. 
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Table 7.1 


VILLAGE 

Average 

ABC 

Range 

XYZ 

Average 

Range 

Age, male 

35 

1 mo -45 yrs. 

42 

25-80 yrs. 

Age, female 

37 

1 mo.— 60 yrs. 

48 

18-90 yrs. 

Income, family 

$5000 

$2000-$ 100,000 

$6000 

$5000-$7000 


Although we might compare the villages by saying the typical citizen 
of ABC was younger than the typical citizen of XYZ, we could not talk 
fairly about a difference in average income for the typical citizen. The 
income within ABC ($2000-$100,000) varies so much more than the 
variation shown between classes ABC and XYZ ($5000-56000), that it 
would be difficult to make a meaningful, or statistically acceptable, test 
for differences in average incomes. Moreover, the facts that some very 
high-income individuals live in ABC, and that income in XYZ is nar- 
rowly distributed about the average, are likely to be very important 
if the purpose of our system definition is, say, selling insurance or 
deciding how to stock a store. Average age and average income would 
not be good prototypes of these villages for purposes of comparison. 

The solution, which increases the complexity of the system definition 
because we have grouped too much, is to stratify the data, or break 
it up into finer comparisons, to take into account the atypical, high 
income families in ABC. 

Example: Returning to the age-income comparisons of the villages, 
suppose we were able to consult the census figures for each, and found 
the distribution of income by age as shown in Table 7.2, which gives 
percent of population in each category. 


Table 7.2 


VILLAGE 

AGE 



INCOME 



ABC 

< 

35 

<$5000 

60 

$5000-9999 

40 

$10,000-14,999 

0 

15,000 

0 



35 

10 

60 

20 

10 

XYZ 

< 

35 

20 

80 

0 

0 



35 

30 

70 

0 

0 


Then, it would be reasonable to stratify or create a class boundary at, 
say $10,000, and represent each of the two villages in a two-way table 
(Table 7.3) summarizing age and income. 
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IW^rtunately, the wse of stratification or deification Mustrated to the 
vfilace comparison is not typical of more complex compansons. Seldom 
» cno u #i ahout a system to select the most effic.cn. «tegones, di- 

mensions, class intervals, or prototypes at the outset. Data may be un 
able, or only partially available, and not in the form desired. Trials and 
experiments are usually necessary to discover class breaVs that will serve 


the system purpose. 

In the village comparison example, we started with a model tnar was 
too simple, based on our variability criteria, and made it more complex by 
adding dimensions until the result was satisfactory. 

For complex systems, where some computing and data-processing facility 
is available, we can work the other way, Le., start with a system definition, 
which for experimental purposes, has too many description categories. In 
such a simplification, the techniques discussed in this chapter may be used 


in combination. 


Example; We want to describe the element and relationship blocks in 
the Ui>. population into which individuals fall for the purpose of predict- 
ing the detailed opinions of the population regarding a political candi- 
date. Initially, we decide on description of a population element in seven 


dimensions, each dividend into the levels shown below. 


Table 7 A 


Dimension 

number 

Dimension 

name 

Number of 
levels 

Levels considered 

1. 

Political party 

3 

Democrat, Republican, Independent 

2 

Religion 

5 

Protestant, Catholic, Jewish, Other, 

3 . 

Region 

0 

None 

East, South, Midwest, Southwest, West, 

4. 

Sex 

2 

Border states 

Afafe, Female 

5. 

Residence 

3 

Rural, Town, \Jiban 

6. 

Occupation 

S 

Professional, White-collar. Blue -collar. 

7, 

Race 

4 

N'onlabor family. No class 

Caucasian, Negro, Oriental, Other 
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The total number of possible classifications is the product of the seven 
level numbers (3X5X6X2X3X5X4), or 10,800. Clearly, add- 
ing more dimensions or levels rapidly increases the number of possible 
combinations beyond reasonable bounds, certainly for visual tabulation 
purposes, and for most data-collection budgets. For example, suppose 
we add preference for candidate A, B, C, or D as dimension No. 8, and 
media exposure to TV, Radio, Newspapers, and Magazines as dimension 
No. 9. Then the number of possible combinations is 10,800 X 4 X 4 
or 172,800. 

Although the storage of this type of mass data is not impossible, it 
would be cumbersome for analysis and prediction, and we would want to 
reduce the number of classes. We would proceed in the following way: 

Step 1: Although all the combinations suggested are possible, there 
are many very improbable combinations, or combinations representing so 
small a portion of the total population that they would contribute little to 
our prediction of candidate popularity. These category-combinations can 
be eliminated. For example, there may be one or two Negro-Jewish-Profes- 
sional-Males in the Rural-South who prefer Candidate C, are Republican , 
and primarily read Newspapers. However, that event is so unlikely and so 
unimportant in the total, that the specific combination is dropped. Continu- 
ing with more enthusiasm, we note that the number of Professional workers 
in Rural areas is so small that the entire block of possibilities containing 
this combination of levels may be eliminated. Proceeding in this way, we 
may reduce the possible number of combinations to a much more limited 
number. 

Step 2: With our now smaller list, we begin to ask questions about the 
usefulness of some of our dimensions and levels-are they really all neces- 
sary? Suppose we know the combination Urban-Catholic-Negro is highly 
correlated (in terms of political preference) with Urban-Catholic-Caucasian 
opinion. Need the racial distinction be made here or in similar types? 
Obviously not, if our purpose is to predict political opinion by extrapolating 
from the opinions of blocks, or prototypes, of population segments. We 
may group the possible combinations, because the variation between them 
is so small in terms of our system objective. For example, we may eliminate 
Race as a dimension entirely, except for the few critical areas where it may 
be important, say Urban-Negro-East. Just as we used our knowledge of 
the census data in Step 1, we would use our knowledge of political science, 
sociology, and past voting data to effect a further elimination of distinctions 
that were highly correlated, by grouping them into a more general class. 
(We would make certain, of course, that the correlation was stable, by 
examining the stability of past history.) 

Step 3: As a final step we would test the prototype blocks of population 
remaining in our reduced system to confirm that individuals represented 
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by the prototype (it) varied only little in their political opinion within 
then specific class, and tended to move in opinion with their category 
neighbors, (b) that there was 3 sufficient difference of opinion and move- 
ment of opinion between the remaining classes, so that further grouping 
would not be desirable; and (c) that the stability of our final class defini- 
tions could be assured in field data collection that would bring to our files 
die opinion trends necessary for updating our simplified system. 14 


♦OTHER DIRECT SIMPLIFICATION METHODS 

In addition to elimination and grouping, at least three other direct forms of 
system simplification find wide use. These approaches are ( 1 ) The substitu- 
tion and transformation of variables, relationships, and parameters, (2) the 
exploitation of system symmetry, and (3) the tabular rearrangement of data 
rows and columns into partitioned blocks. Wc will briefly review each ap- 
proach m turn. ( Many other forms of direct simplification can be found in 
specialized fields. When such methods are found useful in the reader’s work, 
he should note them down as an addition to our present, abbreviated list.) 

♦Substitution and Transformation 

We can often use substitution of variables, relationships, and parameters 
to simplify a system description. These forms of simplification are also 
useful to bnng an unfamiliar model into a familiar form or to ease analysis. 
Example: We often use a simple substitution of parameters to simplify 
equations. To illustrate, suppose we wish to solve the equation 

(P - $)** + (p + q)x + _ o 

r 

in which p, q, u, v, and r are constants, for x. The equation is cumber- 
some, and we may not recognize it as a standard form. However, by 
substituting 


a=(p-g) h = (p + 9 ) c = , 

r 

the original equation becomes 


aj* + bx + c = 0 

“ MrfSS? ITZk ntlvfjZShfci "5 Burdidc, The 4S0. 

calcgonci used by political scientists to anaJvL l< f W ~‘ e 4S0 P°P u ^ tion 

the Kennedy campaign of I960. The Append^ l i ‘ e , Dt ' moCfali c Party in 
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the familiar quadratic equation with the well-known solution 

—b ± \/b 2 — 4ac 
x = . 

2a 

We may now resubstitute the original values, and obtain the solution in 
terms of p, q, u, v, and r. 

- (p + 9 ) ± V(p 4- g) 2 - 4 (p - q)(u + v)/r 
2 (p ~ (?) 

In less obvious cases an ingenious rearrangement of terms and careful 
choice of values for substitution may be needed. Several examples of 
this type appear in the derivations of Appendix B. 

The substitution of variables and relationships according to a consistent 
rule leads to a transformation in which we convert one set of variables 
and relationships into another set. The mathematical transformations dis- 
cussed in Appendix B are examples of this method of simplification. 

The one-to-one transformation, which maps each possible system in one 
scheme to a unique set of systems in another is of major interest, because 
it is reversible, and we can thus obtain solutions in the original framework. 
Figure B-l illustrates this transformation-inversion procedure. 

One of the major questions that arise when we transform a system from 
one scheme to another is which system properties remain invariant under 
the transformation. Thus, if we draw a circle on a plane and then transform 
that plane into a cylinder (by rolling up the page until its edges touch) 
\ve preserve many properties of the original circle. The circles area, cir- 
cumference, and radius will be the same as before if we measure these 
values along the cylinder’s surface. However, points on the circle’s cir- 
cumference are no longer equidistant from a perpendicular drawn through 
the center of the circle. This fact is certainly of interest to a sheet metal 
worker who attempts to fit round pipes together; a round pipe of the original 
diameter would not fit the hole in the cylinder. We would also be interested, 
as systems analysts, in how alterations of the circle in the plane correspond 
to alterations of the circle on the cylinder and vice versa. 

The use of logarithms provides a more direct example. When we take 
the logarithm of two numbers, say 8 and 16, the difference between the 
original numbers is not preserved, e.g., logz 8 = 3 and log 2 16 = 4, a 
difference of 1 instead of 8. Moreover, operations on the transformed 
numbers are transformed, too. The addition of the logarithms is equivalent 
to multiplication of the original numbers. This knowledge is important be- 
cause if we consistently apply the corresponding operations to the original 
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and transformed numbers, the results of those operations will be identical, 
e.g., log* 8 + log* 10 = 3 + 4 = 7, and the antilog of 7 to the base - is 
2- = 128 = 8 X 16. Thus the ability to perform easier operations on the 
transformed system is of great value in simplifying systems analysis. 

We will leave further discussion to Appendix B, but cite a famous prob- 
lem to illustrate the power of the transformation to aid analysis. 


Example: The famous Swiss mathematician Leonhard Euler presented 
the following problem and its solution to the Russian Academy at St. 
Petersburg in 1775.’* In the town of Konigsberg in Prussia there was 
an island called Kneiphof, marked A in Figure 7- 12a, with the branches 
of the nver Pregel flowing about it as shown. Seven bridges, marked 
a, b, c, d, e, f, and g connect the island to three land areas, B, C, D. The 
question. Can a person plan a walk in such a way that he will cross 
each bridge once but not more than once? The townspeople amused 
themselves trying diSerent routes, some claimed it could be done, others 
denied the possibility. Euler solved the problem in general as well as for 
the specific case by converting the map to a Sow graph. 

Let the node of a graph represent each of the four land areas. A, B, 
C, and D. Let the arcs of the graph represent each of the seven bridges 
connecting the nodes appropriately. Figure 7- 12b shows the result. The 
bridge problem is the same as that of transversing the graph from a 
given starting point over each arc only once and returning to the start. 
Call a node or vertex of the graph even if an even number of arcs lead 
to it, and odd otherwise. Euler proved in general that the required 
route is possible only for graphs that have all even nodes. If the graph 
has no more than two odd nodes, all arcs may be covered in one journey, 
but it is not possible to return to the starting point. In general, if the 
graph has 2n odd nodes, where n is any integer, it will require exactly 


Adapted from L Euler, “The Seven Bridges of Konigdxng," reprinted in J R. Newman, 
l he World of Mathematic*, Simon and Schuster, New York, 1950, pp. 575-580. 
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n distinct journeys to traverse it" These general deductions were 
possible by study of the graph, although they were not obvious from 
the map. 

‘Symmetry 

Another important simplification principle is to invoke the concept of 
symmetry, or to exploit symmetric system structure if present. Although 
we use symmetry to indicate “beauty” and “fine form” in common language, 
we refer more specifically to bilateral symmetry, exemplified by nght- and 
left-hand similiarity, mirror images, or the geometrical symmetry of a circle 
about a diameter or a sphere about a plane through its center . 18 

As a simplification procedure, symmetry permits us to extend analysis of 
a smaller part of a system to a result for the system as a whole. Thus, if we 
know a hemisphere has a volume of two cubic feet, we know at once 
that a sphere made of two such hemispheres will have a volume of four 
cubic feet. In systems analysis this principle is important in handling com- 
plex decision trees, probability problems, and structural analyses. The 
device is greatly enhanced if we can find symmetries within symmetries. 
Although extensions of this idea abound, one simple example will suffice here. 
Example: If a stick is broken in half at random, what is the length 
of the smaller piece? Solution: “Breaking ‘at random’ means that all 
points of the stick are equally likely as a breaking point (uniform dis- 
tribution). The breaking point is just as likely to be in the left half as 
the right half. If it is in the left half, the smaller piece is on the left, 
and its average is half of that half, or one fourth of the length of the 
stick. The same sort of argument applies when the break is in the right 
half of the stock, and so the answer is one fourth of the length .” 19 

" For the full proof, which is a simple and direct argument, the reader must consult the 
reference cited. However, to disprove the Konigsberg possibility, we give the follow- 
ing from Euler, which sets the pattern for his later proofs. Suppose you start at A 
and begin to traverse arcs, recording the route as A, B, A, C, . . . , etc. A pair of 
capital letters indicates an arc is crossed, e g , AB or BA indicates that one of the two 
arcs between A and B, is crossed Since five bridges lead to A, the letter A must 
appear at least three times in the sequence describing the route Simdarly, the letter 
B must appear twice, since three bridges lead to B, similarly D and C must each 
occur twice "That is to say,” Euler concluded, “the senes of eight letters that represent 
the crossing of the seven badges (with A as a starting point) must contain A three 
tunes, and B, C, and D each twice, but this is quite impossible with a senes of eight 
letters Thus, it is apparent that a crossing of the seven bndges of Konigsberg in the 
manner required cannot be effected.” Eulers argument and later generalization is 
most easily visualized by inspection of the flow graph. 

’* See H. Weyl, Symmetry, Pnnceton University Press, Pnnceton, N J , 1952, reprinted 
in abbreviated form in J R. Newman, The World of Mathematics, Simon and Schuster, 
New York, 1956, pp. 671—724. 

”F. Mosteller, Fifty Challenging Problems in Probability with Solutions, Addison- 
Wesley, Reading, Mass , 1965, p. 63. This small book contains many problems illus- 
trating symmetry. 
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‘Partitions 

If a system is very large it may be possible to break it into several smaller 
independent subsystems. This principle was illustrated in Figure 7-6 when 
we discussed threshold methods of simplification. However, we provide 
some matrix illustrations here for emphasis. 

Fust, tabular data, particularly data that represent input-output relation- 
ships, may be collected in a form that does not indicate any independent 
structure. Such matrix representations may come from the development 
of sets of simultaneous equations, from observation of system transactions, 
or from other sources. At the outset, the arrangements of the rows and 
columns, set by the arbitrary numbering of system elements, may produce 
a “random" looking result. See Figure 7-13, where an “X" indicates a table 
entry and a blank a zero. 

However, by judicious rearrangement of the rows and columns of the 
matrix, we may arrange the data in blocks, or in practical cases into near 
block form. In Figure 7-13a, for example, the rows and columns are num- 
bered 1, 2, 3, and 4. If we rearrange both rows and columns in the older 
1, 3, 2, 4, we obtain the blocked result shown in Figure 7-13b. This revised 
result would indicate relationships between elements 1 and 3 as a sub- 
system and between elements 2 and 4 as another subsystem. The two sub- 
systems are independent because no entries relate 1 and 3 with 2 and 4. This 
ideal may be approximated in many cases, either by rounding, by using a 
threshold cutoff, or by a row and column rearrangement that recognizes 
larger table entries as more important than small- The rearrangement will 
be more obvious with sparce matrices, i.e., those with many zero entries. 
When perfect blocks of table entries cannot be found, approximate blocks 
will be useful to indicate approximate subsystem divisions and the im- 
portant relations which link them in the over-all system. 

The ability to partition data in this way also has many computational 
advantages. If a table can be broken up into smaller blocks as shown below, 
the segments of the table can be handled separately. When this is done 
it may be possible to handle a very large single computation as a series of 
smaller computations. The ability to eliminate xeros or insignificantly small 


12 3 4 


1 fx x 

2 x 

3 x x 

4L x 

(a) 


7-13 Data arrangement tor partitioning. 
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numbers from the computation reduces computation time and the memory 
space required for storage if an electronic computer is used. Thus, given 
fixed computing facilities, a large "impossible" computation can be handled 
by treating that single problem in stages. 

Example: Consider the multiplication ofalxj matrix by a column 
vector as shown below. 


'2 4 0 0* 
12 0 0 
0 0 3 2 
0 0 12 . 


X 



|10, s, 17, Ilj 


Compare this result with the two multiplications of the 2 X 2 blocks 
partitioned from the original matrix by 2 two-element column vectors. 


[i ®l 



f!7, Ilj 


Tbe original result can be obtained by recombining the two smaller 
results, but all of the multiplications by zero have been omitted, and 
the number of positions used for storage of the problem has been reduced. 

Thus, partitioning has wide use in the analysis of large scale systems, 
both to simplify interpretation and to reduce computation. The study of 
input -output economic relationships and the manipulation of such data 
provide obvious applications, as does the analysis of other forms of trans- 
action and interaction data. (For several input-output readings that will 
illustrate the simplification of large-scale systems defined in tabular form, 
see Problem 7-7.) 

In this chapter, we have discussed a number of direct simplification 
methods. In Chapter 8, we will study system simplification methods that 
may be applied indirectly, or at a higher level of system hierarchy. 


PROBLEMS 

7.1 Following the diagram of Figure 7*4, in which the problem was to find the 
maximum value of 

Revenue — 50* -f 75y 
while observing the restrictions 

(1J 8* -f* 4y 400 
(2) Sx + I0y^400 
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suppose additional restrictions were added as follows. 

(3) 7&x + 4 3y 400 

(4) 4 8x + 102y^4GO 

(5) 8 5x + 4 3y ~ 400 

(6) 52a-)- 9 8y 2 = 400 

(7) x + y^80 
(&) x —50 
(9) y ^ 50 

A. Make a plot, similar to that of Figure 7-4, but including all of the 
additional constraints Is the feasible area of solution substantially changed? 
B Which of the constraints could be eliminated without affecting the maxi- 
mum value of revenue 9 (That is, which could be thrown out as irrelevant?) 
C Which of the constraints could be grouped, thereby reducing their 
number, without materially affecting the solution? 

D Why is the simplification of the "system” in this way a useful process for 
the analyst? 

E A computer program used to find the point of maximum revenue (and 
the corresponding mix or combination of x and y) takes a length of time 
which is approximately proportional to the number of constraints imposed. 
Comment on the economic balance between the cost of computation re- 
quired to find the precise solution to the system containing nine constraints 
as opposed to the loss of precision in the solution that can be obtained 
mote quickly and inexpensively if the problem is simplified to only the first 
two constraints? Why would such simplification procedures be of interest 
to the analyst who confronts a large-scale problem, ic., one with a very 
large number of constraints 9 

7 2 Consider the problem defined m three variables by the following: 

Maximize Revenue = 50x + 75y -y I00i 

where 


U) 8x + 4y + s^400 

(2) 5x + lOy -y z *£ 400 


lW ° *nw*»« Pta« w a Mimciuto 

S - t i H/Tr 111 ' ' h ' ‘ e “’ ble 1» values of «, , 

*-* w i, it xrro? oT s ot i, “ teas ' Me !Y ° 

— — « « ■ 

C. Ww*, o r « aW™. »!,« „ tf* re , cm)e? 

- oWa, “"' oi *■ *** 
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7.3 The game of tick-tack-toe is played on a 3 x 3 table, or matrix, by two 
players. The first player selects and marks one of the nme squares. The 
second player selects and marks one of the remaining eight squares. The 
first player then selects and marks one of the seven remaining squares, etc 
(“Xs” and "Os” are usually used to indicate the choices of the first and 
the second player, respectively.) 

The game ends when either player succeeds in marking three squares 
so that they form a straight line, horizontally, vertically, or diagonally. The 
game may also end in a tie, in which all the squares in the table have been 
filled, but neither player has achieved the winning result 

A. If the matrix of play is numbered in the usual matrix format, so that 
each of the nine squares in the table is designated by a particular subscript 
pair (i, /) from (1, 1) through (3, 3), how many distinct patterns of play 
would be available to the first player— if the game were not terminated by 
the “win” rule above? The second player? What effect does the win rule 
as a constraint have upon the number of possibilities for each player? 

B. At the outset, the first player has nme possible square-choices, if the 
matrix notation ( i , j) is used. If we invoke the principle of symmetry as a 
simplification procedure, how many basic choices does the first player have 
on the first move? What are they? 

C. How many basic choices does the second player have on the second 
move, if the first player has chosen the center square on the first move? 

D. Why is the principle of symmetry a powerful simplification procedure 
when it can be used? 

E. Suppose the first player does not originally select the center square. How 
many basic moves are there for the second player if the first player (1) 
selects a comer square, (2) selects a peripheral midsquare? Why is the 
power of symmetry as a simplification procedure reduced in these cases? 

7.4 Suppose you are making a study of United States real estate prices with the 
objective of buymg 50,000 acres or more for residential development. As 
a start you propose to make a map of the United States that will show in 
different colors the average price per acre across the country. 

A. If your time and resources required the completion of this preliminary 
study in 30 days, how could you simplify the data collection and data 
display? 

B. How would you locate the required price information in an efficient 
and inexpensive manner? 

C. How could you simplify the storage of your basic data so they could 
be updated with new price trends? 

D. In making a more detailed study of "interesting” areas, how could you 
simplify the cross-reference problem for the many local factors that might 
be used to compare one region or tract with another? 

E. How might local factors cause you to review your original picture? 
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7.5 As Fanny HJ1 once observed, there is a certain repetition in life. Fortunately, 
Fanny was nght 

A Why is statistically stable repetition an essential requirement for statis- 
tical simplification? 

B. Would definition or simplification of a system be possible if there were 
no repetition of events? 

C W'e often hear a conflicting quotation, attributed to the pre-Socratic 
philosopher Heraclitus, who observed “You can never step in the same 
nver twice,” and “ . the only constant is change." If this world view is 
true, does it conflict with the concept of statistical regularity and stability 
required for system definition? If not, bow arc the two extremes compatible? 
D The late Bill) Rose made one of his fortunes by a statistical analysis 
of popular songs during the thirties. He found a large proportion of “ooh" 
sounds in the popular lyncs of the time and thereupon composed a number 
of songs according to that formula, such as “Barney Coogle With The 
Goo-Coo-Coogley Eyes " What assumption about the market made this 
approach reasonable, and, as it turned out, very rewarding? 


7.0 In the payoff matrix below, three strategies, S,, S 2 , S 3 , are proposed. The 
payoff for each of four possible outcomes, N lt N : , iVj, N 4 , (whichever 
occurs) 13 given Strategy Si is a dominant ttrategy, because it has a larger 
payoff under all possible conditions than S 2 or Sj. 



A T . 

N t 

N» 

N, 
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A. Why 13 the search for a dominant strategy a simplification procedure? 

danger m developing an incomplete list of possible strategies? 
V hat is the danger of developing an incomplete list of possible outcomes? 


7 7 f °“ ora ' g articles frora SrteMfc American maga- 

one. Note the data handling and simplification methods used by the authors. 

a- >*b!« may be mhnpJed and hose 
Partitioned. The leader mil benefit more 
■' 1,e h “ W - “vert a matrix, as desorbed 

W. W. Leontiel, ‘Input-Output Economy.- Scum,, lie American. Ortober 

w sirs sr “ - «■— 

W SeptoWrlW?' Stn,c, “" “ ! Development," Scientific American, 
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W. W. Leontief, “The Structure of the U.S. Economy,” Scientific American, 
April 1965. 

A. P. Carter, “The Economics of Technological Change," Scientific Ameri- 
can, April 1966. 

(A complete set of these articles in repnnt form, together with a large 
multicolor wall chart showing the complete input-output table of the 
United States Economy with latest figures is available from Director of 
Research, Scientific American, 415 Madison Avenue, New York, N. Y.) 



higher-order 

8 SIMPLIFICATION METHODS 


The detailed methods of definition and simplification discussed thus fat are 
most suitable for the study of relatively simple systems. Detailed Bow charts, 
tabulations, lists, equations, and logical rules are the paraphernalia of a 
systems analyst working to Improve a procedure or process that is both 
limited in scope and stable m cbaracter-or one abstracted sufficiently to 
produce those desirable conditions. 

Theoretically, of course, the same approach and the same tools could be 
extended without limit to the large, complex, goal-directed system- Yet, a 
number d! practical difficulties suggest an altogether different approach and 
set of tools. 

If a system is able to formulate its own goals and plans, to switch goals 
and plans at will, and to adjust internally its own allocation of resources 
and priorities of action, knowledge of current structure is less useful for pre- 
diction and control than knowledge of the guiding goals and values which 
integrate and guide die total system. Moreover, for very large systems, a 
detailed definition may not be feasible, even if it were useful. In short, the 
analyst's resources must be redeployed and his thinking shifted to a different 
level if he is to make headway in an environment of true complexity. 

As a consequence, vve now shift our level of discussion from that of 
detailed system definition and direct simplification to the higher level con- 
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sideration of system goals, objectives, values, and measures of effectiveness 
—both present and potential. 


A RESORT TO HIGHER-ORDER DEFINITION 

A statement of a systems major goals, objectives, values, and measures of 
effectiveness, together with a few major constraints, allows the analyst to ap- 
proach the “controlling structure” of the system at the outset, rather than 
after collecting detail about operations. Because the list of goals, objectives, 
and measures of effectiveness a system uses will diminish in length as we go 
up the organization hierarchy, the analyst may, by this approach, limit him- 
self in his definition to a smaller number of important relevant points, yet 
retain those factors that have pervasive and lasting effects upon system 
operation. He may, indeed, find the higher level definition a more stable 
and predictive way to define the system than the use of detail. 

The Simplifying Effects of Higher -Order Definition 

The higher-order approach to system definition simplifies by working with 
the systems control structure hierarchy. This more general, higher order 
method provides a definition less time-dependent than the detailed structure 
definition, and covers more special cases, thereby reducing the variety of 
definitions that would otherwise be required for numerous special cases. 
We give up something in specificity of prediction, but we gain in generality 
of coverage. 

The higher-order approach stresses relevance rather than completeness 
or precision. Just as the design for a building may be an elaboration of the 
outlines of an architect's sketch on a dinner menu (as was the case with 
Eero Saarinen s design for the futuristic TWA Air Terminal at New York’s 
Kennedy Airport), the analyst’s detailed system of definition may originate 
from the relevant insights gained from the higher order “sketch,” which, by 
its very simplicity, projects these points. For this reason we speak of “this 
view from the top” as a simplification method. 

The higher-order approach is also concerned with the system’s potential 
for improvement, growth, change, and optimization. In planning and 
control terms, it is a strategic approach. We want a trade between 
one form of effort and another, between one measure of effectiveness and 
another. We arrange our definitions to produce this result. 

Goals and effectiveness measures 

Before continuing, let us define goals and effectiveness measures. 

A goal is a desired state or result we hope to reach. For example, to 
hit a given target, or to come as near to it as possible, is a goal; to achieve 



202 Higher-Order Simplification Methods 

the maximum possible profit, or the minimum possible cost, mute .given 
constraints is a statement of a goal. (Other synonyms, such as ob)ectrve, 

purpose, and aim, may he used.) -_i. 

An cffcctiecness measure, sometimes called by mathematicians 

- - how well we are doing with respect 


icctive {unction," allows us to measure •— >* •* - ~ , y e 

to the goal, and in this way specifies m detail what we mean by the goa 

accomplishment lor a particular problem. 

For example, suppose our goal Is to maximize profit in a given operation. 
This is a vague idea until wc specify that profit is to be calculated as 
50x + 75y. The statement Maximize Profit, where Profit = 5 Ox + 75y, is 
now complete. We know what we want to accomplish (maximize) an 
we know that we must measure the result of any combination of x and y in 
terms of that objective. The effectiveness measure provides us with a rating 
scale and allows us to compart* alternate courses of action (say, various 
combinations o! x and y) by their relative effectiveness measures. 

One alternative {or set of alternatives) is called more effective, or 
“better" than another (or another set) if it receives a higher effectiveness 
measure score (higher profit in this maximizing problem). 1 

We see that the most effective choice of alternatives possible (or 
feasible) is the “optimum' choice. This is specifically what we mean by 
“optimum”- Not some vague notion of the “best” result, but rather the highest 
score on an effectiveness measure scale that clearly defines bow the best 
shall be measured 

The effectiveness measure thus leads us to the “best" choice or choices 


of alternatives, i e , those having the highest score or equally highest scores 
on the scale we hace defined or accepted as our yardstick. Wc can readily 
see the importance of clearly defining the effectiveness measure for a system, 
when wc realize that changing the scoring method will change the game. 

If a touchdown in football were worth less than a field goal, i.e., if we 
computed the final score by giving more points for a field goal than for 
a touchdown, the team activities would change accordingly. The team would 
attempt more field goals and fewer touchdowns. Similarly, in the maximizing 
example, if Profit = 75x + 50y (instead of 50x + 75y), we could generally 
expect a shift in the optimum combination of x and y. Moreover, the relevant 
points of system definition change, too. The constraints on field goals and 
the constraints on the production of x become relatively more important 
than the constraints on touchdowns and the constraints on the production 
ofy. 

Later in the chapter, we develop the implications of the effectiveness 
measure. Wc should note, however, that effectiveness is not necessarily the 
same as efficiency. Effectiveness means to produce the desired result, to 


* Similarly, should the objective be to minimize cost w t 
nun, mum measure Other goals and effectiveness measure 


would seek a low score, or 
axe obviously possible. 
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accomplish the correct end, or to secure the relevant outcome. Efficiency 
in its generic use means to accomplish any stated objective without wasting 
resources. Thus, we may find the most efficient route from New York to 
Memphis in terms of time, cost, or other resources, but this solution would 
not be effective (or relevant) if our objective were to go from New York 
to Los Angeles. 

To tlie economist, efficiency has a more precise meaning. For the maxi- 
mizing problem given above, a choice of x and tj would be called “technically 
efficient" if, for a stated value of x, we had made y as large as feasible or 
possible, so that our potential ability to increase tj had not been wasted. 
(Under such conditions, a change in x requires a change in y and vice 
versa.) In this technical sense, we may ha\e many “efficient" combinations 
of x and y, namely those combinations that force us to the limit of our 
limited abilities or resources. To look for technical efficiency is insufficient. 
Each efficient combination of x and y may have a different effectiveness 
measure, and only one combination (or a few) may be able to achieve the 
maximum effectiveness score.* We must have a clear effectiveness measure 
before we can select the optimum course of action. 

With these preliminaries, and hopefully with a higher motivation and 
interest in the specification of clear and workable effectiveness measures, 
we will now discuss the more complex system, consisting of a hierarchy of 
subsystems, subdivisions, or departments we hope to define by our higher 
level methods. 

The definition of conflicts 

Conflict in decision making arises when the outcome of a choice can 
be valued from two or more viewpoints. For example, suppose that in a 
deterministic system strategies Si and S 2 are proposed and that from one 
viewpoint the values of the outcome for each strategy are S, = 10 and 
S 2 = 20. Then, the highest value strategy is S 2 . However, if from another 
viewpoint the values are different, say S x = 18, S 2 = 9, the choice is 
reversed. Thus, to select the highest value strategy, the user must agree on 
the method of evaluation, and if he cannot, conflict results. 

One method of resolution is to seek a “higher” method of evaluation 
which encompasses two (or more) methods of evaluation as subsets. To 
illustrate, suppose the values shown for S, and S 2 by the above two methods 
are additive, and that we agree to abide by the total result, a new value 
which includes both. Then the new values are Sj = 28, and S z — 29 and the 

* The reader may like to compare this discussion with the problems illustrated in Figures 
7-4 and 7-9, and to follow up the point by reading the first few chapters of R P. Dorf- 
man, P. A. Samuelson, and R. M. Solow, Linear Programming and Economic Analysis, 
McGraw-Hdl, New York, 1958, and C. J- Hitch and R. N. McKean, The Economics of 
Defense in the Nuclear Age, Harvard University Press, Cambridge, Mass., 1960. 
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* r«oKtd. To ra o, ? 


suppress one scheme of evaluation 


^MOraugh simplified operations research and systems analysis studies 
oiten use one of the resolution approaches above, practical resolution^ 


conflict in most important choice problems presents serious evaluation 
difficulties. Questions of additivity, scaling, utility-and whether measure- 
ments of certain values can be made at all-are not easily resolved, huch 
questions, however, have a direct bearing upon the outcome of a choice 
procedure and thus, if incorrectly answered, can seriously distort a system 
definition and its use. 

Therefore, in the higher level of definition of a complex system, we 
seek conflicts in the goal structure of the system’s organization of goal- 
directed efforts We take this approach because system difficulties usually 
show up in the way the system rates or measures the success of its major 
subdivisions, or departments. The analyst will usually select as a guide a 
level of hierarchy detail that will reveal the major conflicts between func- 
tional departments or blocks in the work flow. 

In earlier chapters we have seen that the configuration of a systems 
transformation blocks and the transactions between them provides the 
detailed capabilities of the system, and allows us to predict system output 
for a given input. Such a definition can usually be obtained for the relatively 
simple system. 

If the system is of a higher level however, it will contain various 
choice points. The various blocks, or subsystems, may adopt alternate 
response patterns to alternate goals, either because this ability is built into 
the system design, or because the subsystems have learning or innovating 
abilities. 

Our design problem then is to coordinate the subsystems into an 
organization that will enhance the over-all purpose(s) of the total system. 
We w ant to predict the effect ol goal-directed activities of the departments 
on total system performance. 

The most likely trouble-spot in the design of a complex system, or the 
operation of an ^existing complex system, is not that the individual blocks 
do not operate efficiently" or even effectively regarding their own stated 
goals, but that the goals guiding these operations do not, when combined, 
result in either efficient or effective operation of the entire system. 

Example; Consider a typical industrial organization consisting of three 
functional departments or divisions; Sales, Production, and Finance. 

‘ "° I f 10 SoaWtroct the output of the subsystems to avoid random 
action. Let us suppose that the departments mil execute efficiently any 
goa we se e must then select an effectiveness measure for each 
department, for the use of their managers and staffs. 
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For example, suppose we decide to base a salesman’s commission on 
a fixed percentage of individual gross sales. Then the ‘"best” salesman 
would be the one with the highest gross sales, or the “best” Sales De- 
partment would be the one with the highest total gross sales. 

Similarly, we may rate the Production operation on a “production 
cost per unit” basis. Then the “best” production manager or the “best” 
Production Department would be the one that, in a given period of time, 
produced a given unit output at least cost. 

And, if we use maximum cash liquidity, or minimum bank charges, 
or minimum bad accounts, as a criterion for financial excellence then 
the financial officer, or the Finance Department, will adjust operations 
to achieve these goals, and different financial officers and departments 
may be scaled accordingly (Figure 8-la). 

In short, if a given goal or objective has been recognized and judged 
accordingly by the subsystem (or individual in the subsystem), let us 
assume that the decision-making apparatus of the subsystem (or indi- 
vidual) will select available alternatives. The chosen alternatives should 
improve the “success measure.” Moreover, the subsystems will attempt 
to alter the structure of operations, the resources available, and the 
restrictions present in order to attain “higher scores.” By design, psy- 
chological direction, or other motivations, we can usually accomplish 
this result. 

At first glance, we may be happy with our accomplishment The 
departments and individuals know their functions and purposes, they 
are performing them efficiently, and they are learning, by our measure, 
to improve their performance. 

Trouble will arise, of course, if while improving its own success or 
effectiveness measure, the goal-seeking actions of one department (or 
individual) thwart, frustrate, and defeat the goal-seeking actions of 
other departments (and individuals). 

For example, the subobjectives of maximum gross dollar sales, mini- 
mum production cost per unit, and maximum cash liquidity seldom 
produce a harmonious operating result for a system of three functions 
(Sales Department, Production Department, Finance Department) that 
hopes for maximum profitability. 

The product mix sold to achieve maximum gross sales may not be 
the most profitable, or cheapest one, to make. The cheapest product mix 
to manufacture may not be the salable one. The methods and equipment 
required for the cheapest manufacturing cost may require major capital 
investments, thereby detracting from financial liquidity. 

Worse, we may find that rating each individual solely on gross 
sales produces less total gross sales than the Sales Department could 
otherwise achieve! The lack of cooperation among individual salesmen 
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(a) 



(b) 


8-1 Two structures of systems and subsystem objectives In (a), as described in the 
text, obvious conflicts of Interest are generated between the departments by the way 
In which subgoais have been set In (b), the subgoals have been modified. The mea- 
sures of effectiveness are more complex, but attempts to maximize the rating score 
computed tram the mere complex measure will force greater cooperation between 
the departments. 


engendered by our stnct gross sales commission policy could cause 
conflicts within the Sales Department itself. 

The same argument also applies to Production and Finance. There 
is no guarantee that the same success measure applied to individuals 
and groups will produce the desired over-all result. Furthermore, there 
is no assurance that the goals set for individual departments in light of 
their functions will, when combined into a system of subgoals, produce 
the desired result for the system composed of these parts. 
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After watching the load on the three machine departments, the 
owner decides he could possibly fit another product into the line by 
using the idle capacity of the various departments, and he puts this 
plan into effect. 

Business increases, and the owner adds additional capacity to the 
three departments by buying more dnll presses, lathes, and saws, which 
in turn generate the ability to add to the width of the product line, 
which he does. As business grows, the owner finds he needs a greater 
variety of equipment; as a result, he adds new departments: a Plating 
Department, a Painting Department, an Assembly Department, a 
Shipping Department, a Credit Department, etc. 

Unfortunately, as the operation grows in size and complexity, the 
owner has difficulty finding the cost of making his different products— 
because of scheduling, machine leading, and other problems (not to 
mention the accounting problem of overhead distribution ) . He can never 
seem to pin down the responsibility for a given product or given order, 
so that costs nse, lead times extend, and misquotes are made. He begins 
to lose orders, and his staff always seems at odds over who is responsible 
for a delay. Massive inventories build up in-process between depart- 
ments. So that schedules and plans can be meaningful and not subject 
to crises each hour, the cost of communication and control systems and 
devices increases. 


i. S L 0rt> owner kk tota l system effectiveness measure 
(which be may have defined as, say, net profit before taxes per dollar 
ot gross sales) is declining. 

Moreover the owner has trouble finding the cause ol the difficulty. 
So many of the operations required for a given products production are 
entangled, enmeshed, and merged with the operations required for the 
production and coordination of other products that the problem of 
mTch ncs f S' “ a seems insuperable-even though, for Jormple, the 
”ffidenev- -T? P T ^ at 100* of capacr*. or 

he mes dm d- TT COnlr ° I of the total operation. Although 

is no/ JZSZZi St' organization 

8.2“ product-line, project, or mission basis (Figure 


a product-line W, but a"so nhvsicS'l !Uperv “ or >; responsibility along 
mem and reorganise it S a ^S “ para, \ lh<! Paction equip- 
grouping. ^ uc *-flow, rather than a functional 
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PRODUCT 1 
PRODUCT 2 
PRODUCT 3 




(b) 


8-2 A shift in organization as a manufacturing system grows in complexity. In (a), 
the small shop passes each product through common operations. When the variety 
of products is small, there is efficiency in shared operations. However, as the variety 
of products and operations grows, the costs of coordination, machine interference, 
in-process inventories and bottlenecks increase, and it is frequently more efficient 
if the operations are regrouped on a product-line basis, even though there is dupli- 
cation of productive facilities. The gain comes from the improved control ability 
and predictability obtained in the latter case. The growth history of most large 
Industrial and military organizations follows this pattern 

In the new arrangement, the efficiency of the individual productive 
facilities may not be as great as before, probably less for the “de- 
centralized” operations. Yet the over-all control cost, the inventory size, 
the lead-time length, and the cost estimate error will have been reduced, 
thus providing an increase in the total system’s effectiveness measure. 
If we consider all product lines, the over-all effectiveness of the total 
organization will have been increased. 

In the new physical and supervisory arrangements, the effectiveness 
measures of the new “blocks” in the system will also have been changed. 
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The owner would now be concerned with the profit before taxes on each 
product hue per dollar of sales, a measure that allows comparison ot 
products by their profitability. Indeed, to allow present product lines 
(and product-line divisions) to be compared to alternate ^investments, 
the owner may change all the effectiveness measures to “return on a 
five-year investment period,” which may have further reorganization 
effects upon the system structure. 

Questioning current effectiveness measures and the effects of possible 
alternate effectiveness measures, as we have seen, can have substantial 
repercussions in the operabon of the total system and its parts. 


Multiple goals 


Just as the divisions of a system may have conflicting goals that restrict 
the full attainment of a clear over-all system objective, it may also be 
possible that the system as a whole, or one of its subsystems, has conflicting 
multiple goals. Such a system (or its part in conflict) like the house 
“divided against itself,” usually finds itself in trouble. For this reason, 
multiple, yet conflicting goals are of interest in system definition. 

Note that this form of goal conflict is different from the functional or 
operating conflicts discussed above. In the previous section, for example, 
the goals of each subsystem were clearly understood and followed. Further- 
more, the total system goal was clear. Problems arose because the clearly 
followed subsystem goals caused conflict between the subsystems, and 
because the combination of such goals did not promote the clearly under- 
stood total system goal. 

In multiple goal conflict, the system or its parts may not clearly 
perceive a set of goals consistent with themselves. For example, an indus- 
trial concern may state its objective as maximizing total gross dollar sales, 
and simultaneously maximizing net profit In most large-scale industries 
that manufacture many different products and distribute to widely dis- 
persed locations, these two goals are inconsistent: The gross sales ob- 
jective argues for maximum sales without regard to profitability, and the 
prontabuity objective argues that items in the product-customer mix should 
be promoted selected),, with reference to individual profitability rather 
than to total gross sales. 


ohtSr^T'n' * e and marketing decisions, and the results 

obtamed, tell usually be different if the finn seeks one objective or the 

unde; L lTT .V°, aChk ' e bo,b staultanrUly. except 

“fhestm e l' y “ Whi0h lh0 P™ 6 ' —gin on all items 

mi Md distribution cost for all cus- 

WeoittZl ( T VO u d ” With 0ne P roJuct “Id to one customer), 
oiten find such self-«,„tradictory objectives when we have several 
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trouble symptoms. First, the system may be aware that it cannot yield 
allegiance to two disparate goals simultaneously. To resolve the problem, 
the system may attempt to follow one goal at one time and another goal 
at another time. This solution may be tolerable if the alternate goal 
choices are not grossly opposed to each other. However, oscillation between 
extremely disparate goals can reduce the system’s operation to total con- 
fusion, breakdown, or immobility, particularly if the conflict between 
goals is so severe that no goal or set of consistent goals can be selected. 
Such extreme results are common in the psychiatric literature concerning 
human breakdown, and are also evident in the operation of many large 
organizations and hardware systems. 4 * * * Clearly, planning is difficult or im- 
possible under these circumstances. 

Similarly, the system may attempt to adopt other strategies to rid 
itself of the multiple-goal problem. It may appeal to a “higher” goal, which 
subsumes the conflicting goals and permits use of a new effectiveness 
measure. It may arbitrarily restrict its list of multiple goals to a consistent 
set, or it may “lock” on such a set and refuse to consider others (even 
though conditions have changed and the old goals are no longer appro- 
priate for the growth or survival of the system). It may attempt to weight 
conflicting objectives and thereby artificially create a new single objective 
and effectiveness measure. Or, it may select and follow a simple goal or 
set of goals and leave the satisfaction of the remaining conflicting goals 
to the guidance of stated operational constraints. For example, it would 
be consistent to maximize gross dollar sales, while requiring the component 
actions to produce a profit contribution not less than 10%. This constraint 
would allow the simple objective to be followed, while modifying it to 
satisfy partially the requirements of the alternate, conflicting objective. 8 

As in the previous section, the resolution of multiple goal conflict can 
produce marked system improvement. Yet, regardless of how we resolve 
the goal conflict, our first step must be to define the goals and early 
attention to this systems analysis problem will simplify later ones. 


Constraints, risks, and commitments 

We have just seen that a constraint can modify a system objective by 
limi ting or requiring a certain level of performance. In addition to stated 


4 It is unnecessary to ate a roster of goal-conflicts that may be found in any complex 

society, and therefore in its systems. Any observer can create his own list of Hamlet- 
like dilemmas. However, as a point of discussion, the reader may consult R. S. Lynd, 
Knowledge for What?, Princeton University Press, Princeton, N.J., 1939, especially 

Chapter 3, "The Pattern of American Culture,” pp. 54-113. 

* For a more complete discussion see: the Appendix “The Simple Mathematics or 

Optimization,” in C. J. Hitch and R. N. McKean, op. cit. 
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constraints, which are introduced arbitrarily to eliminate 
problems, most systems also have physical constraints that hi 

' ‘nos, m a higher level definition it is desirable to look lor the major 
physical constraints that may grossly affect system operation. Such con- 
straints may be material- or information-processing abilities, or the time 
delays aad response-time characteristics of the system. 

Similarly, most systems have critical choice points at which major por- 
tions of the system’s resources may be committed. When such decisions to 
commit the system to action risk failure, resulting in a substantial portion 
of the systems resources being lost or ineffectively used, knowledge or 
such points and types of decision must be included in the higher level sys- 
tem definition. This knowledge is necessary because decisions involving 
large commitments may not have the same effectiveness measure as those 
for more routine decisions; i.e., the utility scale may change.* 

Example: Suppose an industrial firm has the objective of maximum 
gross sales with the constraint of not less than 10 % profit contribution 
on any sale. A salesman obtains a large order that will not only increase 
gross sales, but will also meet the profit criterion. However, that order 
represents a large percentage of the total productive capacity of the 
system. Thus, the risk of cancellation in midstream, the effect upon other 
customers due to delay in their deliveries, and possible expansions in 
productive capacity required to meet the over-all demand are all im- 
portant factors to consider in the decision to accept the new, large order. 
Once the new order is accepted, the firm is committed and the point and 
manner m which the commitment is made become important factors in 
the higher-level system definition. Here, the utility of the dollars received 
for the new order may be less than for the usual order, and the firm 
may decide to handle it differently. 


goal-conflict 
vc the same 


A Device tor Higher-Level System Definition 

One fata for summarizing the facts of a higher-level system definition 
is an nfonmtion Inventory Sheet, as shown in Figure S-3. The example 
shotra refers to the processing and distribution of frozen chicken by a 
purchasing-manufacturing-distribution system 

The outline presentation can be drawn on a large sheet of paper. At 
the top we draw a rough block dragram showing material flow and 1 informa- 

' tto V V ,; M- K. Sun. 

fflutale. won E-skuaed CM,. N.I., 1861. To 

minimum stake they Lcsrin proposed loUery, but if $100 ti the 

liut be ^ ^ *° ““P*' against than to reUt.on to the prize 
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B-3 Data collection and analysis chart. A helpful tool in deciding where to start 
inventory investigations is a complete list of decisions, risks, and problems, under 
function, on a single piece of paper. In this form, the inter-relationship of problems 
can be seen, and work can be ranked in order of importance. The example shewn 
refers to frozen chicken production, inventory, and sales. 





2)4 Higher-Oilier SmpIMM Uethvls 
U0 » Sow, A o„ ra , provided 

colunm » fen Avided fen ■*•»£* “ 3 block. (Because of 

fa *“** a toger 

sheet would permit more detail ) 


This sheet may bo completed by sihlalion . 

at each block. » ith the det.,1 oil be sheet ffloefn ta* £ o 
TvnicaUv when such a sheet has been filled out for each l omx, 
o ^ lnZnn collected (vnth all the inJomuHcm ^pl^i ^ 

Le for the supervisors to view and discuss to conference) vvdl produe 
additional entries, revisions, suggestions, and comments. 

Moreover, all the information displayed at one tune for the »upe “ 
to View and discuss provides a common ground for understanding th 
organization's objectives and problems. 

Example: In the frozen chicken example of Figure 8-3, diSerent prob- 
lems arose. There were labor troubles at the processing plant, because 
of erratic scheduling. Field inventories of the product (chicken breasts, 
legs, wings, pies, whole birds, etc.) were unreasonably high, and un- 
balanced. Although the stock was sufficient, the right item was not a 
the right place at the right time; consequently, shortages, distress sales, 
and frequent expediting were the rule. 

Because orders could often not be shipped from the mix warehouse as 
they were ordered from the field, the local sales people (at some 90 
warehouse locations) tended to board what they thought would be 
needed, to pad orders to the factory, and to estimate local trends er- 
roneously. The erratic production schedules existed because tbe mi* 
warehouse was small compared with field storage capacity and produc- 
tion had to be geared to emergency orders. 

A commitment to deliver a given pack could not be reversed, nor 
was it easy to reverse a decision to send a shipment to a given field ware' 
house location, because transshipment costs would represent more than 
the profit on the product Worse, the chicken market is highly seasonal 
placing peak loads on the buying and production operations. 

The Information Inventory Sheet consolidated these facts with the 
stated objectives and constraints of each block, or department Although 
this system definition is simplified (tbe details of processing activity 
and distribution activity are omitted), the Information Inventory Sheet 
has the advantage of displaying the “over-all” picture, so that the rela- 
tion of deblock objectives, constraints, risks, and problems can be seen 
by all 

It became apparent here, that the stated individual block objectives 
could not be achieved with a tightly coupled system. The need to make 
major production and distribution commitments early, caused largely by 


and stall 
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the small capacity of the mix warehouse, seemed to be the source of 
many of the difficulties. By enlarging this mix warehouse, which formerly 
was only a marshalling area to make up carload lots, and by changing 
its purpose to that of providing a buffer storage and speculation storage 
(rather than leaving this purpose to the field warehouses), the commit- 
ments on production runs could be scheduled without the interference 
of emergency requests. 

Moreover, the commitment resulting from the shipment of large 
blocks of production of a given pack to a given field location could be 
postponed, and shipments could more nearly meet market demand. 
(Here the grouping of variability of demand and the ability to forecast 
over a short lead time are definite advantages.) These changes in ca- 
pacity and objective served to “decouple” the production and distribu- 
tion operations from unpredictable changes in the retail market, and 
furthermore, to decouple the production operation from the live chicken 
market. When the production operation could be planned for labor 
stability and divorced from immediate shipping and sales requirements, 
more attention could be devoted to handling the raw material efficiently. 

The changes in purpose and capacity did not completely solve the 
stated problems of this organization, but they did provide major improve- 
ments in a time shorter than what would have been possible if the system 
definition had proceeded from the detailed to the general, rather than 
from the general to the specific. 

Although our information inventory example was taken from a processing 
industry, the reader can, by extension, see the usefulness of the method 
in other systems definitions problems. 


THE SIMPLIFYING EFFECT OF CHANGES 
AT A HIGHER ORDER 

Earlier in this chapter we saw that a change in system effectiveness 
measures together with planned reorganization could simplify a system 
definition (e.g., the case in which the functional organization was changed 
to a product-line organization). We now pursue that subject further. 

Suppose an investigator were assigned the task of finding the quickest 
way to control a new job in a large-scale organization. (This is the task 
confronting a new administrator unfamiliar with the details of an organiza- 
tion.) Suppose further that, after cursory examination, the organization 
itself is found to have many of the goal-conflict problems we have described, 
as well as numerous political factions, operating difficulties, wasteful prac- 
tices, and technical inadequacies. How should the investigator begin? What 
questions should he ask? How could he cut through oceans of data without 
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loring perspective? If he is to supervise, how does he obtain control of 

hlS For this ossipunenb 'onr investigator will certainly strut with *= Wssb«- 
level approach. He will use his knowledge of the conflicting de P 
objectives, the critical decision and commitment points, and the procedures 
by which the organization exercises control. . . . 

Although he may be expected to find only a handful of powerful mdi 
viduals in even the largest organizations, and also to realize that his m 
coercion sources lie in the power of the budget and the power of appoint- 
ment, he may choose at first to shun these individuals or devices and view 
the organization at “arms length.” . , 

So that he may not be confused by too close a view, or be swayed y 
the briefings or political charm of the organization s present department 
heads, or waste time in a continuous series of conferences, he may prefer 
to formulate and to ash a series of pertinent questions in writing, and to 
delegate the responsibility for answering them to particular individuals. 


with particular deadlines. 

Our investigator will receive some answers, but not others. Sub- 
sequently, he will spot deficiencies in the organization and locate conflicts 
of interest, duplication, omission, and critical commitments. (He may cor- 
relate these facts in an Information Inventory Sheet, or other convenient 
means, such as marginal cross referencing on the reports received.) One 
of his more important questions will be to request the evaluation reports 
by which the organization judges its performance (the documents and 
methods that implement measures of effectiveness in practice). Frequently 
he will find these reports of performance are contradictory, misdirected, 
unused, or lacking altogether. 

lie may then proceed to define his system by a synthesis of this material 
(if it is satisfactory), or he may choose to simplify the system by manipu- 
lating its objectives, effectiveness measures, and methods of reporting 
performance. 


Note that to control the system our investigator may of necessity 
simplify his definition of the system or the system itself. It should also 
be evident that usually he cannot accomplish these simplifications by a de- 
tailed specification of changes in operating practices, detailed instructions 
to each individual in the organization, or detailed intervention in each 
exceptional case, nor can his present definition include these facts. He can, 
however, simplify system structure, operating practices, and decision- 
making, if he restructures the system's goal hierarchy and monitors per- 
formance against the new goals. 

This high-level change may involve reorganizing the budget, changing 
the objectives of key personnel (or replacing them with others who will 
operate under the new objectives), and changing the format of reporting 
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procedures and the communication flow. In other words, the investigator 
may combine system treatment or improvement with the definition of his 
system, bypassing the morass of detail he cannot control directly. 

Although we have only sketched this approach to higher-order simpli- 
fication through redefinition or manipulation of the system, we now present 
two dramatic examples that illustrate our point more fully. 

Example: In 1981, four weeks after he became Secretary of Defense, 
R. S. McNamara dispatched drafts of questions, and sought not answers, 
but suggested information sources and possible deadlines for obtaining 
answers. On March 1, 1961, he issued the formalized drafts of his famous 
96 questions (known as "McNamara’s 96 Trombones”) with a deadline 
and a man assigned to answer each question. 1 The answers, often re- 
turned to the author with requests for more specific information, pin- 
pointed the duplication effort (the Pentagon had 3000 committees), 
contradictory estimates and plans (the separate intelligence services of 
the Army, Navy, and Air Force had different measures of the missile 
gap and peril), and omissions in organization (the Defense Department 
had no division charged with examining the organization and procedures 
of the armed services, although that was its task; instead, each service 
had its own department supposedly performing that function internally). 

First, the obvious duplication was immediately eliminated (e.g., more 
than 400 committees were abolished in the first six months), and the 
missing divisions were added ( the Defense Intelligence Agency and the 
Office of Organizational Management were created). McNamara then 
reorganized the budget and comptrollership operations and set the ef- 
fectiveness measures by which the organizational complex was to 
evaluate itself (and be evaluated at the higher level). In vindication of 
General Taylor’s position, these measures were adjusted to a mission, 
rather than to a functional or service basis, with the over-all objective 
of increasing the flexibility of the services instead of staking everything 
on the concept of massive retaliation, which had been fostered in the 
past, mostly by the Air Force. The widely reported changes m the 
Defense Establishment are an outgrowth of these early actions, which 
for the first time made the system manageable by a civilian authority, 
and did so, not by changing the practices of the military direedy, but 
by changing the measures and questions by which their sources would 
be evaluated. 

*T. H. White, “Revolution in the Pentagon," Look, April 23, 1963, pp. 31-44. This 
extremely complete and well-wntten article gives the background and procedure of 
McNamara’s reorganization and results in more detail. For a later example at the city 
level, see S. V. Roberts, “Mayor to Set up Top Planning Unit,” New York Times, 
July 27, 1966, p. 1. This article refers to the application of Defense Department 
Methods in New York Gty. 
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Higher-Order Simplification Methods 

Example. O. W. WJson, Police Supcrioteoto. of Otago ™ 

(and, like McNamara, a former ] ]oAouts for criminals 

a major police scandal in wine p „ found his force riddled 

SSSSSHsE^sSS 

panized along the boundanes of a political ward, with th . 

dictating justice, some patrolmen had to purchase their o 

PaP ln leorgSiSigX^orce, Wilson, like McNamara, gained wmbol by 
working a? arm’s length. Although he occasionally talked toAePt 
to create public confidence in his activities, he usually avoided the public 
and political visitors, and simply asked questions of his , 

He found, for example, that no organized reporting method existed 
for evaluating the performance of officers and patrolmen. Crime r ®P or 
were often lost or not recorded, and seldom evaluated if available. More- 
over, no assignment priority existed: Noisy parties received the same 
attention as murders in progress. 

Wilson defined his system by creating the reports and communicatio 
practices required to give the men practical effectiveness measures. 

All crime reports by citizens were recorded, coded for computer 
storage, and dispatched on a priority basis from a centralized control 
center via a 12-channel communications network to radio cars. In report- 
ing the disposition of the case, the patrolman could no longer radio in 
that he had resolved the problem; he had to complete and file a written 
report which was matched against the original report in the computer. 
This action discouraged laxity, bribe-taking, and preferential treatment 
of offenders. It also allowed statistical analysis of crime rates, locations, 
and methods of criminal operation. 

Wilson drastically Unproved the operation of the force by a combina- 
tion of measures involving redefinition of the system and the introduction 
of new communications equipment, more radio cars, and adequate sup- 
plies. However, the most powerful force in this improvement was in the 
redefinition of the system, by providing it with effectiveness measures 
that allowed Wilson to control the system. Dot be a subject to its chaos, 
political rivalries, and criminal intrigues.* 


THE PROBLEM OF PRESENT AND POTENTIAL CAPABILITY 


The higher order approach to system definition simplification is often 
directed to an evaluation of alternate possible operations under changed 


N umcrous author* have reported on the Chicago Pohce reorganization. The most recent: 
J. Starr. Chicago Shovv« Way to Police Reform,” Look, October 19. 1965, pp. 43-19. 
Sec alio, O. W Wilson, Police Administration, 2nd ed., McCraw-IljU, New York, 1963. 
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objectives, effectiveness measures, and constraints. For complex systems, 
this form of long-range definition demands the analyst’s highest skills, and 
usually requires a multidiscipline approach for satisfactory results. 

Thus, we may ask for a definition, not of present system operation, 
but of potential system operation. For example, the potential actions of 
a complex system are of prime concern to a military or industrial intelligence 
officer, who must anticipate future opponent actions . 9 He seeks to simplify 
his problem by ruling out opponent actions unfeasible or unlikely owing 
to constraints in the opponents system. 

Such constraints may be physical constraints (the available equipment), 
environmental constraints (prevailing terrain, weather, raw materials), in- 
formational constraints (available technical talent, educational resources, 
communication networks), and goal and value constraints (the rigidity of 
goal and priority selection, which limits possible goal and value choices in 
the system). 

Example: A system has a wider range of possible actions at its disposal 
and can take them faster if it has a large number of uncommitted 
resources. A shop with substantial idle capacity on a variety of produc- 
tive machine tools can take on a variety of new production orders 
quickly; but, a shop with a small variety of totally occupied equipment 
cannot easily do so without a time lag for the acquisition, installation, 
and planning required for new facilities. 

In addition, a system is likely to make a more precise selection of 
appropriate actions and to implement them more swiftly if it devotes a 
substantial portion of its available resources to long-range activities. The 
supervisory control program of a modem computer installation physically 
occupies much of the high-speed memory of the machine, as opposed to the 
area required for the execution of a current calculation. However, the 
memory resources devoted to this "long-range” control permit the machine 
to select appropriate subroutines rapidly, and to execute a sequence of 
different calculations rapidly, thereby increasing the system’s effectiveness 
for a wide job mix. Similarly, the military intelligence officer unconcerned 
with the anticipation of possible changes in enemy organization and goal 
choices and preparation for them is unlikely to perceive the many alternate 
courses of enemy action, or to have resources available to cope with them. 

The objective of this form of definition, of course, is the exploitation 
of long-range enemy weaknesses, or the strengthening of long-range weak- 
nesses in the friendly system. 

Example: On the national level, industrial, input-output tables that 
show by transfers of productive output how industrial segments of the 

* See the interesting discussion of international intelligence system organization and opera- 
tion in S. Kent, Strategic Intelligence for American World Policy, Princeton University 
Press, Princeton, NJ., 1949. For a discussion of industrial intelligence operations, see 
W. Fair, “The Corporate C.I A ," Management Science, July, I960, pp. B489-B503. 
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economy are related, both totally and geographically, are comrnordy 
prepared. 10 Thus, a certain portion of the steel industry s output (which 
is relatively stable under normal conditions) is consumed by the steel 
industry itself, some by the automotive industry, some by the electrical 
industry, and so on. A change in the total output will then pervasively 
affect the entire economy, because of the needed output transfers. 


We may then ask from such a presentation: (1) "What are the critical 
industrial segments of our own economy that if improved would have the 
greatest effect on total production?" and (2) “What are the critical seg- 
ments of the enemy economy that if destroyed would have the most 
damaging effect?* The military implications of such questions are obvious 
when we think of the selection of bombing targets, and it was this form of 
analysis that selected the German ball-bearing industry for destruction in 
World War II. From a domestic viewpoint, we may want to rephrase the 
questions geographically. For example, “What would happen to the capa- 
bility of the United States economy if an atomic bomb were dropped on 
Cleveland, Ohio?" Most nations engage in such long-range capability 
analyses of their own and foreign countries, but the same approach is 
clearly appropriate for anticipating the effects of casualty losses, strikes, or 
improvements at the firm level. 

In the assessment of long-range growth, we are interested in the 
rales with which constraints and goals of a system can change, as well 
as their direction of change. Such questions, of course, also involve the 
speed with which a system can reorganize its present resources for alternate 
application in response to changed goals. For example, from a military 
viewpoint, "How quickly can the typewriter industry shift its productive 
output to machine guns?” From an industrial viewpoint, “How quickly 
can a competitor change his present product line?” 

It is important to realize that the analyst can define a system by its goal 
and constraint structure, and measure quantitatively or estimate quantita- 
tively the amount of surplus resources available for a given system. More- 
over, it is usually possible to measure or estimate the rate with which change 
in these resources and then use could be effected. 


Example: A small shop provides the easiest example. At any time, given 
the number of shrfts ssorltcd, the number of avatlable hours by machine 
type is known, or con be estimated within tunow limits (allowing for 
breakdown, tool setups and changcovcrs, and the like). Similarly, it 
is possible to find both the average and the instantaneous loads on each 
_ <ype of «l“P">cnt, so that a slack figure giving smplus hours available 

Jtatrmmt Scwn«, 

The Sinai./. e/rh. Omened r-°* ' ^ definition and anil>us. V, LeonUftI, 

the W mo„i,ph^Sr, p ^S“ V - ° rf0 " 1 U “ V "“W York, 1931, « 
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for alternate use may be computed. Next, the speed with which the 
equipment can be alternately used can be estimated from planning and 
tooling lead times, the mix of equipment, and the skill of management 
coordination. Finally, it is also possible to estimate the time necessary 
to supplement existing machine resources by acquisition of added equip- 
ment, overtime work, subcontracting, and similar alternatives. With such 
quantitative estimates available, evaluation of alternate production pos- 
sibilities in response to alternate production goals may proceed on a 
quantitative basis, too. In this way, alternate production possibilities can 
be ranked by an effectiveness measure and alternate production facilities 
(different firms, say) can be ranked by their effectiveness for a given 
task. This type of definition and analysis has both industrial and military 
significance, and many such studies are performed in planning large 
purchases, or selecting military contractors. 11 

The steps in a capability study 

In developing a long-range capability study, three steps are involved: 
description, report, and speculation. 

Development of present constraints will show that many are permanent. 
For example, the terrain, roadway, rail, communication, and basic raw 
material constraints of a nation, as well as the moral and social character- 
istics of the population, population clusters, educational levels, and 
demographic characteristics do not change rapidly. In industry, the physical 
locations of the plants, warehouses, and major tools do not change on a day- 
to-day basis, nor do the basic skills of the employees, the products, or the 
characteristics of the workers. The development of this background infor- 
mation may be called the descriptive element of the long-range definition.' 2 

Some constraints, however, change with time: leaders, current tech- 
nology, and available technical talent, exotic tools, and equipment. As a 
result, basic description must be updated constantly if the analyst is to 
make revised projections of capability. This is the reportorial element of the 
long-range definition. 

Finally, projections must be made from both past and current informa- 
tion so that long-range potential and the effectiveness of alternate system 
actions may be evaluated, or so that the probability of alternate opponent 
actions may be estimated. This is the speculative element in the definition 
process. 

’‘Similar capability measures can be made on social systems, using the skills of the 
political scientist, sociologist, and anthropologist. See K. W Deutsch, Political Com- 
munity at the International Level, Doubleday Short Studies in Political Science 
SSPS-1, Doubleday, Garden City, N. Y., 1954, wbo (in Chapter 4) lists 14 selected 
measures for defining the characteristics of nations and communities. 

” The terms used here are those of S. Kent, op. cit., who provides extensive examples of 
each phase of the definition. 
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The suggested steps are equally appropriate for the long-range definition 
of the industrial organization, the political sectors of a country (municipal, 
county, and state), and the long-range improvement of hardware systems 
by redefinition and redesign. 

It may appear initially that the projection of system capability and 
action does not simplify the analyst’s burden, but rather creates data that 
further complicate the analyst’s job. The next section shows that this is not so. 


THE DATABANK 


In application, the execution of the descriptive step often results in the 
creation of massive dossiers, files, and other collections of documents. (The 
New York Times, for example, m an attempt to anticipate the background 
needs of breaking news, maintains bibliographical dossiers on 1-3 million 
individuals, 500,000 corporations, 100,000 places, 100,000 ships, and 10,000 
maps. 1 * ) With modem electronic computer equipment much of the descrip- 
ti\e step may be reduced to a databank or master file of information 
organized so that information may be retrieved quickly and inexpensively 
in response to selected questions by the analyst 

For example, in the industrial application, the master file may contain 
an inventory of all the products made, keyed by part number, together 
with such identifying factors as price, inventory level, orders outstanding, 
manufacturing specifications, mariettas data, competitive products, alter- 
nate choices of materials or vendors, alternate manufacturing practices, 
wid other selected information, with cross-references to the background 
fifes less frequently consulted. 


In the municipal file, data may be keyed to the street address of land 
parcels, so ‘that tar rosters, enme statistics, street repair statistics, municipal 
lighting statistics, and similar information may be readily available to 
“S“" ° n S,'“ , S e questions, such as ‘What will be the effects on the tar 
m f m CduCat “ ,nal requirements, utility capacity, and eco- 
SmSS? ?■ “ 3 “* kr Sartre aparimen, is built at 

_ SiDa! 10e0 - the e'eetxonic compute, has made the mechanization of 
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lie" spa per. moucied alter the organization of a large 
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the databank both technically and economically feasible for wide use. 
Because the computer has improved the density of information storage 
as well as the speeds of updating and retrieval by orders of magnitude, the 
technical feasibility of handling large files of data that may be searched 
in complex extraction combinations is available at low cost today. 15 More- 
over, much of the databank information can be used for servicing routine 
applications, such as inventory status reports, customer billing, short-range 
production planning, and development of market statistics— applications 
which justify the creation of a basic databank that when supplemented can 
be used for long-range planning by the industrial firm. The direct benefits 
in government, such as tax billing and control, inventory and asset control, 
and work scheduling can also justify the creation of databanks. 

In addition, the recent development of electronic remote input data-sets 
(such as those provided to the airline reservation clerk or savings bank 
teller for “on-line, real-time” updating of seat and dollar inventories) to- 
gether with their associated communication systems further raises the 
ability of “management information systems” to provide current updating 
of master files, a previous stumbling block in accurate file maintenance and 
control using the databank. 

Thus, it is both technically possible and economically feasible today to 
maintain large-scale inventories of information, which can be updated 
with current information. The implications of this ability should not be 
overlooked by the analyst who seeks to make long-range projections, be- 
cause the availablity of this resource greatly simplifies his job. 

Historically, for example, the time and cost spent on systems analysis, 
particularly for long-range projections and evaluation of alternatives, was 
devoted in an 80/20 ratio or more to data collection and “purification” 
as opposed to analysis and projection. The application of most of the 
techniques of statistics and operations research also bore the burden of 
onerous data collection. With the steps of description and reporting reduced 
to a mechanized format, this ratio is readily reversed. 

Thus, recent technical and economic improvements in data collection, 
storage, and retrieval permit the analyst to concentrate on question genera- 
tion, analysis, and speculation. For the large-scale system, particularly, the 
control advantage of this new asset is overwhelmingly important for the 
analyst. Indeed, many forms of analysis, such as statistical analysis of 
“real-time” data inputs, would be unfeasible without these developments. 

How capability problems and intelligence questions arise 

Future capability problems and substantive intelligence questions 
usually arise in one of three ways: (1) As the result of reflection by men 

“ Marvin Camrus, “Information Storage Density," IEEE Spectrum, July, 1965, pp. 93- 
105. This article is of additional interest, because effectiveness measures for various 
forms of storage media are developed in it. 
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the system analyst’s product. These routes may combine to generate qu 

^For example, questions may arise from the “desk- of a» 
officer assigned a geographical area or an industrial product g p, 
he surveys both the background information and incoming information m 
his area A surprising fact, or a contradiction between mcomlng data 
and background history, may prompt a question. The introduction of a ne v 
product, the plans for new facilities or equipment, or needs of an executive 
may initiate questions, as was the case with McNamara and Wuson. 

The following examples illustrate some of these sources. They have been 
arranged to illustrate questions regarding constraints, then questions con- 
cerning goals and goal response. 

First, an inventory of assets presently available to a system frequently 
indicates that the relative value of the asset mix has changed with time, 
often without appropriate utilization changes. 

Example. As the result of a proposed computer installation on a "total 
system" basis, a large oil company made a list of its assets, such as 
producing facilities, real estate, station leases, chemical equipment, 
crude reserves, financial resources, credit card lists, present computer 
equipment, and similar items. Although the firm was formerly production 
oriented, and adjusted its objectives to the maximum utilization of pro- 
ductive potential, it soon became apparent that its 6-milhon credit card 
holders were an even more important asset than the Tofining facilities. 
The firm changed its effectiveness measure from a production to a market- 
ing orientation and thus focused attention on the exploitation of the 
credit card holders. Items other than gas and oil were added to the 
product mix because they had greater profit margins: Consumer items, 
insurance, and motel rooms, were made available on credit, both at 
stations and through the gas and oil bills mailed monthly. The change 
in objectives produced by an analysis of asset potential markedly im- 
proved the profit and loss statement of the firm. 1 * 

Similarly, an analysis of capabilities may reveal that some assets are, 
m fact, liabilities. 

Example: When Billy Prince became President of Armour, the firm's 
objectiv e was to be the largest meat packer in the United States by sales 
volume.” An inventory of the firm's assets, by their profitability, showed 

" ° l VWl tyPCl ** T ‘ Uvut > lnf wwlioru in Marketing. McCraw-HiU, 

b - y c W ? tovide a before and after companion. See 
«d R Ubar SlW Fortune, January 1938, p. 120. 

and R. Lubar, Armour See* Fat Yea,* Ahead/* Fortune. October 1939. p. 117. 
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and other Air Force operations, many commercial aircraft corners, 
as TWA, display current status of performance, equipment failures, a 
current operating difficulties in a central briefing room, connected y 
private line telephone to the major national airport installations. A oaiiy 
survey of this information, presented to operating executives in verbal 
graphical, and statistical form, stimulates questions that may be ansvvere 
by the telephone hookup, or by special investigation. By using this ap- 
proach, an epidemic of part failures for example, may be detected by 
statistical observation and action taken to prevent serious loss. Indicators 
of long-range difficulty may also be derived from all parts of the system 
by using the communication facilities and the summary properties or 
the briefing room display. (The radar-room scenes of television and the 
films are familiar examples of the briefing room approach in combat 
situations. The electronic computer, with its databank of information 
and electronic input devices, permits more sophisticated use of the 
briefing room approach to question stimulation. But, frequently, in com- 
plex situations, the ability of the human brain to correlate and synthesize 
facts comes to its greatest usefulness in developing questions. Man and 
the computer cooperate in this situation to produce results better than 
cither could produce alone.) 


Let us turn now to a few examples that refer more to the political 
scientist, sociologist, anthropologist, and psychologist than to the engineer, 
mathematician, or accountant. 

In addition to the potential capability that could be exploited, which 
we have just discussed, wc must often determine if that potential mill be 
exploited by an opponent’s ot ones own system. We are all aware of the 
individual with great talent, who is insufficiently motivated and therefore 
docs not use »t. and, conversely, of the individual with more limited 
resources who uses all of them to the limit of his abilities. In fact, the 
latter individual may be more productive than the former But if the 
former could be goal directed to exploit his greater assets, his potential 
capabilities would produce more results than the capabilities of his limited 
friend. Furthermore, both individuals and organizations have the ability 
to grow and Icam, so that we may also ask to what degree a possible 
change in goals and values would cause a change in the potential capabilities 
of the system. The following examples, for these reasons, have a direct 
bearing on studies of potential. 

The disparity between a present level ol achievement (using a given 
effectiveness measure) and the letcl perceived as an "optimum* or satis- 
factory objective may, rf known, bo used to assess the pressures causing 
diangc in the system performance. 

The so-called reference group of sociology is based on the idea that an 
analyst can measure the "relative deprivation" perceived by an individual 
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or a group between a present state and the one held as a goal. When 
this measurement can be accomplished, the analyst can gain some long- 
range insight into trends of opinion and action. We would call this form 
of definition a goal capability study. 


Example: During World War II, morale studies were made of the 
troops. 20 Analysts expected Negro troops in the South to have a lower 
morale than Negro troops in the North; yet upon investigation they 
found the contrary result. The Negro recruit related his current Army 
status and pay to that of the local Negro community, not to an absolute 
scale. Army Negroes in the South had more available funds to spend 
than the local males therefore they were able to afford more luxuries 
and attract the local Negro women. The lot of the Army Negro in the 
North was worse, by comparison to the industrially employed Negro 
male. In the North, the industrially employed Negro enjoyed the ad- 
vantage of relative luxury and female admiration. The relative depriva- 
tion of the Northern Army Negro lowered his morale. 

To extend the example, the human being, group, or organization can 
be expected to have pressures for change toward a goal if ( 1 ) the goa 
is within the potential capability of the system’s resources, and (2) it 
the goal has not yet been reached. The greater the disparity between 
achievement and objective, when resources are available, the greater 
the pressures to change. 


Example: Tie concept of the "reference group' has been widely 
exploited a, a motivator of change in the marketing of consumer products. 
For example, if a salesman can locate the individual or fanuly con- 
sidered a reference group (“The Joneses ) in a g IVL 'n commum , e 
may attractively induce a sale to this "thought leader and exploit the 
social pressures so generated. Aerial time- apse p o ograp 
conditioner installations show remarkable epidemics hnuse with 

from the motivating air-conditioner installed in the comer hous l 
children, or in the prestige apartment inhabited by a professional 

worker. 21 


Conversely, for systems in which the disparity 
ment and the ultimate objective is great, the ana ys insufficient to 

in goal patterns, values, and ^ 

£5^2 or 'cannot relieve the constraints 


, p,„j,w.v>n /VoL IV in Studies in Social Psy- 
•s. A. Stauffer, et al„ Measurement and Princeton, N. J, 1950. 

chology nr World War II), November, 1954, p. 140. 

‘ W. H. Whyte, Jr., "Tie Web of Word of Moutu, 
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ihal limit its operation, the analyst can taped revolt, system breahdosvn, 
or erratic patent behavior. The dealings of Louts XVI with the various 
groups and estates of the French people, which resulted in the French 
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electrical motor severely o' 
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m systems of individuals often causes them to duster, in terms of geographi- 
cal locations, in interaction patterns, or in unified actions. This fact is 
also strategically significant in long-range planning. 

Example: To illustrate, consider the following problem, a prototype of 
the political action study which is seldom published in detail for 
obvious reasons. 

The conflict of special interest clusters often thwarts public projects, 
such as construction of new schools. In small New England Communities, 
for example, dramatic conflict often arises between two highly clustered 
centers of value and power. 

On the one hand, we have a familiar cluster represented by the 
old-line Republican, country club member, who is usually Protestant, 
a relatively wealthy land and business owner, conservative in attitude, 
a long-time, well-educated resident who associates with similar types. 

On the other hand, we have another familiar value cluster, the 
Democratic, Catholic, liberal, labor-class, relatively poor member of 
the Elks, Moose, Eagles, Legion, or other fraternities, often with rela- 
tively less education, without extensive property or business holdings 
who also associates with similar residents. 

Although the former group usually holds the economic power of 
the community, the latter holds the political power, being more numer- 
ous. Thus, a proposal on public issues by one side— such as the con- 
struction of a new public school, or the bussing of parochial students 
at public expense— is often defeated, on principle, by the other side. 

The size and extent of each power group can be defined by the 
methods described in Appendix E, “Suggestions for a Study of Your 
Hometown,” or by the prototype and stratification techniques dis- 
cussed in Chapter 7. More important in this example, we can often 
isolate individuals or small groups with critical power positions by 
graphing or sorting their associations, transactions, memberships, and 
other possible relationships. Joint memberships in given organizations, 
joint transactions in daily life, and joint opinions on key issues can 
thus be displayed in tabular or flow graph form. The individual or group 
with any compatible transactions to and from each major cluster often 
has the power to move both. 

For example, in one New England community, which must remain 
nameless, a group of investigators isolated a key individual. She 
was a wealthy Irish manufacturers widow, who belonged to the country 
club, made frequent trips to the bank, but who was also Catholic, 
Democratic and who, from her husbands background, enjoyed the 
confidence of labor leaders, minority groups, and dissident thought 
leaders in the community. She was not active in politics at the public 
level, and her power was not well known. She hardly recognized it herself. 
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Yet, her quiet recommendations to both sides were often sufficient to 
stimulate compromise and cooperative effort/* , . _ 

Thus, in this example, the mechanical study of interest dusters, 
mutual associations, and transactions helped redefine the community 
power structure. Long range questions of community capability to 
achieve given goals, and the methods for achieving them can often 
be clarified by the approach described. 


THE CIRCULAR PROCESS OF DEFINITION, 

TEST, AND TREATMENT 

The foregoing examples should show that long-range capability studies 
are higjvet-otder forms of system simplification. A definition that works 
with goals and constraints allows us to concentrate selectively on system 
features that are now relevant and on features that will probably be 
relevant in the future. 

We have also seen that the definition of a large-scale system will change 
with time, certainly in its detailed description. The higher level approach 
may thus provide a more stable overall system definition, as well as a 
simpler one, and may furthermore guide the analyst in his prediction of 
future system change and growth-thereby simplifying the analysts future 
problems. 

We now come to our final topic for this chapter and this section. Just 
as the system to be defined can change with time, it is also true that the 
analyst will later alter his system definition with time, seeking constantly 
to improve it. 

In short, the Job of system definition never ends for the analyst of the 
complex system. He will make a preliminary definition, test it, perhaps 
make a system change, treatment, or improvement Then, from his experi- 


“The success of the Kennedy family in politics illustrates more dramatically the prin- 
ciples developed in this example For another illustration, the reader should consult 
K. \V. Dcutsch, Nationalism and Social Communication, MIT Press and Wiley, New 
York, 1953. especially Figure 1, p. 18, which graphs the growth of a national elite 
in the United States. 175(M795, according to the late Robert K. Lamb (also the 
author or Appendix E) In this diagram, the political power of Alexander Hamilton 
during the stated period stands out dearly. He was military secretary to George Wash- 
ington. protege of Elia* Boudinot, Director of the Mint, and William Livingston, Gov- 
CTnor of New Jersey, was related by marriage to John Van Rensselear and Philip 
5chu>lcr. and as Secretary of the Treasury was associated with Wdham Ducr, and a 
wide range of other prominent individuals, and institutions, such a* the bank of New 
I. v , perv V 1 T; ', y business ties to the colonies of Virginia, Pennsylvania, 

New York, and Massachusetts case Hamilton key policy, decision-making, and per- 
suasive powers n the early development of the United States. Many of the key power 
cluster* shown fn the referenced diagram persist to this day as the result of continued 
tS CowJ I p^^ nal ' assodatk)n * "■‘dun the power clusters defined in 
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H The production utanoget marts that the maximum number o! units be 
manufactured under die original constraint!, i e, that the problem sWd 
be to make x + y maxmium. What lesult mould then be obtained in terms 
of x and (/? What is the maximum number of units that can be produced!* 


j. Under the original conditions in A, an additional hour of capacity on 
either Machine 1 or Machine 11 is possible. Which should be preferred in 
terms of the maximum revenue objective!* Would this preference change if 
the objective were changed? 


J. Having found the solution to I, how many added hours of machine 
capacity, of the type selected, could be added before the restriction set 
by the other machine or market restriction is encountered? 

K. In A, suppose die income from Product B were fixed at $75 per unit, 
but that the income from Product A could be increased by increasing the 
pace per unit from the present $50 How large would this price have to be 
before the onginal values of x and y, obtained in B, change to more produc- 
tion of Product A? 


L We have now considered the objectives relating (a) maximum income, 
(b) minimum cost, (c) maximum profit, and (d) maximum number of 
items produced Can you think of any other objectives that might be 
proposed’ From this list of possible objectives, how would you select the 
one to use? Why is the potential of unused capacity of interest to the 
planner, whatever the objective selected? Why is it also of interest to know 
where capacity is no longer available? Why is it of benefit to know the 
marginal increase in a given objective that can be obtained by providing 
additional capacities of various kinds? Why are these questions, which con- 
sider only the objective, function, and constraints imposed by the system, 
highly useful for planning-even though we have not described in any detail 
the exact operations of the machines? Can you think of any other situations, 
other than the shop illustration provided here, in which the same form of 
thinking would be valuable’ 


8 2 In developing a new management information system, there are two general 
ways to proceed In the first, all presently used input paperwork forms, 
files, and output reports are examined. Then the new system is devised to 
consolidate the files and to simplify the reports so that an integrated oper- 
ating system results. On the other hand, it is also possible to proceed by Gist 
making an analysis of all the decisions and operations that require informa- 
tion, then by designing a system to satisfy these needs. Comment on the 
disadvantages of each of these approaches. 

8.3 From time to time the Federal Reserve Board changes the prime interest 
rate used by the banking system of the United States. 

A. Why is this a higher lev-el form of system control? 

Variable? S,n,ctura ^ c,,an S« could be expected from a change in this control 



C. How do system lags and anticipations enter into the control picture 
when such higher level control variables are used to change the system of 
commercial transactions? 


Again, consider the tick-tack-toe game described in Problem 7.2. Suppose 
the first player selects the middle square as his first choice. Suppose also that 
the second player selects a comer as his following move. A strategy that will 
permit the second player to guarantee that the game will end in a tie (so 
the first player cannot win) exists. 

A. What is the tying strategy for the second player? (Note that this 
may be found by a check of the appropriate strategy to use for each possible 
basic move of player one on his second choice.) 

B. Write your procedure in learning the tie strategy for the second player. 

C. Could you devise a machine to carry out the strategy you have learned? 

D. Could you devise a machine that would "learn” what you have learned 
about the tie strategy for the second player? (As a start, assume a machine 
without knowledge of the game— which corresponds to a player who has 
not played the game before. Assume also that random moves will be made 
to any feasible square at each stage and improve this strategy depending 
upon the win, loss, or tie results for player two.) 

E. In what way does "learning” impose constraint upon action for the 
second player in this game? 

F. Why does learning generally impose constraint upon action in highly 
structured situations of the type illustrated by tick-tack-toe? Is such con- 
straint a general result of the learning process? 

Frank is hurrying home late, after a particularly grueling day, when it 
pops into his mind that today is Kitty’s birthday! Or, is it? Everyone is 
closed except the florist’s. If it is not her birthday and he brings no gift, 
the situation will be neutral, i.e., no payoff, or 0. If it is not and he comes 
bursting in with roses, and obviously confused, he may be subjected to the 
Martini test, but he will emerge with a position of strong one-upness-which 
is worth 1. If it is her birthday and he has clearly remembered it, that is 
worth something more, say 1.5. If he has forgotten it, he is down like a 
stone, say —10. 

A. What should Frank’s strategy be if he assumes that the outcomes (birth- 
day or not) are equally likely? 

B. How could Frank's decision be changed if his probability estimate is off? 

C. If his estimates of “payoff” are not correct? 

D. If he refuses to accept the outcome —10, without regard to the probability 
of its occurrence? 


SYSTEMS ANALYSIS AND DIAGNOSIS 
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SYSTEMS ANALYSIS: 
A SURVEY 


From what we have seen, the definition of a system will be a circular pro- 
cess— with definition, analysis, and treatment following sequentially until 
the objective of the system definition has been reached. 


THE PURPOSES OF SYSTEMS ANALYSIS 

In developing the system definition we therefore reach a stage at which 
the investigator introduces tests and diagnostic procedures designed to 
assure him that his provisional definition is reasonably good, or definitely 
bad, and to note specific strengths and weaknesses in the system. This 
stage in the circular process is called systems analysis. 


SIX QUESTIONS 

Of course, we have many specific reasons for testing a system or what we 
know about it, but six general reasons may be advanced, and it is useful 
to review these to indicate the range of test types that may be needed. (The 
range of test types needed by the investigator, following our previous dis- 
cussion, determines the complexity of the system definition.) 


237 
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Curiosity— What Is It? 

In the previous chapters we have assumed that something is known about 
a system we want defined specifically and selectively for a given purpose. 
However, this is not always so. Suppose we are presented with a system, 
which may be thought of as a black box with various input and output 
connections, various knobs and dials, and various bumps and corners' - 
and that we have no idea what the thing is, or what it is for. Indeed, tbe 
“system" need not be mechanical or electrical. Suppose we encounter 
some strange item on a beach and wonder what it is and what it does. 
Or suppose we walk into a strange social organization and wonder what 
goes on there. How shall we answer such questions? 

The solution, of course, is to test it. Have we seen anything like it 
before? What happens when we poke it, or turn it over? Does it move or 
does it he still? Can we distinguish the inputs from the outputs? Does it 
have any observable structure, or any regular operation pattern that can 
be seen 9 Does it respond to a few random tests? 1 Does it respond as we 
predicted it would when we subjected it to a specific test? If so, some of 
our curiosity is removed, and we may proceed with our definition, im- 
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proving the predictability of the system model in response to a wider 
range of tests. 

How does It operate? 

As we continue our tests, we may want to know how the system op- 
erates. What are its input-output speciBcations, even though at first they 
constitute only a crude listing? Can these specifications be refined more, 
so that the system definition can be more specific? Will any tests tell us 
how the system works internally, so that we can specify operations for 
internal blocks and relationships? 


Does it work as predicted? 

In a later stage of refinement, we may have a system definition and 
want to know if one or more examples of the system definition in the 
real” world agree with predictions made from the system definition. 

Predictive tests are used in two senses: (I) The test of a system defini- 
tion for realism and logical correctness, i.e., the test of the definition “be- 
cause it may yet be imperfect; and (2) the test of a specific example to 
see if it conforms to the standards represented by a given systems defini- 
tion; i.e., the test of the example under the assumption that the definition 
itself is correct and fixed. 

We may have a case that should have been predicted by our system 
definition (a given input should have given a specified output because 
the case met all the conditions and assumptions of the system definition), 
but was not— here, the definition is faulty. Conversely, our manufacturing 
specifications and proved designs may provide a firm system definition, 
but we may have an instance that does not meet the specifications— here, 
the instance is faulty concerning the fixed definition. Generally, these two 
views of system prediction correspond to research and development test- 
ing and to production testing. 

Example: An engineer designs a radio to work on a 9-volt battery. He 
draws a blueprint stating how to make the radio and provides specifica- 
tions under which his design will work, (a) Suppose a prototype, made 
according to these instructions and supplied with a 9-volt battery, 
fails to work. We would suspect the engineers design, (b) Suppose 
the design has been perfected, and several “successful” units now work 
as predicted under the specified conditions. If a consumer uses the 
radio with a dead battery, he cannot expect it to work; nor could it 
work if plugged into a 115-volt house current plug. These conditions 
are contrary to specification. Similarly, the radio cannot work if it 
were not constructed exactly according to the design specification. 
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This outcome in no way invalidates the design, or system definition, 
but rather is a faulty example of it. (c) Finally, the radio may work 
when supplied with a 6-volt or a 12-volt battery, or if the girl who 
makes it accidently introduced a few -incorrect" parts. This means 
he operation or construction specification is not so critical as the sys- 
tem definition states: The definition may be broadened to cover the 
less restricted oases. 

In this way the system definition is improved by testing until we know 
what can and cannot do. The fi„ a , definition of a syl may be stated 
opera., onally terms of the specific input-ontpu, predietS £££ 

Why does It (ail to work as specified? 

a syriem ^ " ced ' e,ls ** *0 toll ns why 

two as specifically as possible ExartT 'll ? nd WlH give m th “ infonn:1 - 
is faulty? What specific addition ^ “ mp0nent in *he structure 
r ,d solve 

shooting are part of this process. ** *° r ^S 80 *** and trouble- 


Can It be Improved? 

* ™SnV,o^“;“ .He specifications set for 
achieve the specified result faster at lower r exain P Je * we may want to 
resources If the system does no! worT/eZ!- * ^ ieSS use of scarce 
sign, we want to repair or correct it H * CWi “ 8 *° specification or de- 
correct it. How can this be accomplished? 

What ore the effects o( t,ea, m e„, and change? 
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evaluate alternate proposals. We can regard such problems as analyses, 
rather than system definitions, because we presuppose a firm starting 
point from which to evaluate. 


OUTSIDE TESTS 

In distinguishing tests for the foregoing purposes, it is helpful to distin- 
pish between the extremes represented by outside tests, as opposed to 
inside tests. The boundary’ chosen for the system under consideration sets 
this distinction. 

Outside Tests versus Inside Tests 

We may want to apply outside tests, which concern only system outputs 
for selected inputs, either for convenience or from necessity. 

Outside tests are always necessary when we are unable to penetrate 
the system boundary, or when we are able to do so only with difficulty. 
Outside tests may also be more convenient than inside tests even when the 
latter can be applied. 

Example: A physical examination is certainly less painful, safer and 
easier, than, say, a bone-marrow test, or exploratory surgery. There* 
fore, on the human patient outside tests are usually performed before 
the more difficult and dangerous inside alternatives. Similarly, in test- 
ing a TV set, it is certainly possible to open the box and poke around 
with various instruments. However, it is far simpler to look at the pic- 
ture under various trial conditions, and to obtain a diagnosis by this 
means. 

The outside test, then, is usually a response test in which the output 
response of the system to selected inputs is checked against previous ex- 
perience or design specifications. Such response tests may be conducted 
rudimentarily by using one or two test cases, by applying a batter}’ of 
test oases, or by continuously changing input conditions over a specified 
range. 

Example: Several authorities have proposed a simple response test to 
detect hearing deficiencies in newborn infants. One seemingly crude 
but highly effective proposal is to subject an infant in the hospital 
nursery to a tone or a hiss produced electronically through a loud- 
speaker at controlled volume . 3 If the infant wakes up, or if the awake 

*M. P. Downs and G. M. Sterritt, “Identification Audiometry for Neonates: A Pre- 
liminary Report ” The Journal of Auditory Research, 4, 1964, pp. 69-80. 
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infant resconffs by movement, sound, ot other sign, he is clearly able 
to hear. If he does not respond, the infant may have some hearing 
difficulty, and is therefore carefully svatched. The simple r “P°" 
permits early detection of difficulty, which is important because vo 
tubular} stalls and verbal comprehension (dependent upon hearing 
ability) deielop most rapidly in the child's first two years . 1 


UVe the hcattng test, outside response tests offer a convenient screen- 
ing" method, which measutes an individual's response against known or 
specified standards or norms. The response test may also be used to col- 
ieet more information about a system s ability. 


Example Let us continue with the hearing example. The family pedia- 
trician may, in later months, subject the infant to the sound of a drum 
(low note) or a bell (high note), or a clap of the bands to determine 
the infant’s range of audible sounds. If necessary, when the child can 
speak and understand simple instructions, an ear specialist can ad- 
minister more complex tests with varying sound frequencies and vary- 
ing sound levels to plot the complete frequency response for each ear. 
Such additional testing more clearly defines what the child can and 
cannot hear 


Response tests are frequently preferred to inside testing procedures 
when the internal operations of the system ate not clearly understood, or 
when inside tests may alter the operation of the system itself. Thus, we 
may subject a complex system to a sene* of tests to aid our understand* 
mg of its operation, because an attempt to define it by other means may 
be beyond our ability. Yet, to know that the response of the system to 
input A is always ouput C, and to input B always ouput D~ regatdless of 
how these relationships are produced-telis us something useful about the 
system, and may be sufficient information . 4 Similarly, we prefer outside 
tests when we know that probes of the system are likely to alter its usual 


4 Ferfujlnts and Disorders in Communication, Heprint No. B35, The Alexander Graham 
Bell Association tor the Deaf, Inc , Washington, D C-, 1965. 

’The response of a linear system to a sinusoidal input is particularly interesting (see 
Appendix B ) , Because knowing tb» response, we can predict the response to any ticoe- 
vary mg input, If the input to a linear system is a sine wave, the output from that sys- 
tem wjH be a sine wave of the same frequency, but of possibly different amplitude, and 
poxsiWy displaced from the Input wave by a time t or, equivalency, by a phase 
angle «. If we can determine the complete amplitude and phase response of the linear 
system experimentally lot an input of constant amplitude ranging over ail possible 
frequencies, then «c can determine the equivalent transfer function for that linear 
system In mathematical terms (by appropriate curve fitting methods). From this p ouA, 
it is possible to evaluate and predict the response of other forms of input to the system 
dcttrfptten. For this reason engineering tests place major emphasis upon response texts 
and the development .of ^transfer functions for linear systems, either by experiment ot by 
mathematical consolidation of block diagrams or Bow graphs. 
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operation. In such “sensitive” systems selected outside tests may often 
provide the information wanted without disturbing the on-going opera' 
tions. Thus, in political and intelligence studies, the outputs of communica- 
tion services (radio, press, TV) and other publicly available indicators 
of system response to the input of current developments are monitored, 
so that knowledge of a foreign country may be obtained without the risk, 
danger, or disturbance introduced by the presence of internal probes or 
tests, such as agents, wire taps, or insurrections. 

Let us conclude this section with a short discussion of outside tests 
universally used in many fields: The tests for “response time,” input-ouput 
“balance,” and the “correctness” of inputs. 

An important fact about a system may be the time required to trans- 
late a given input into a given output. 

Example: The answer to this question is often obtained by “tagging” 
an input and waiting for the tagged output to appear. The customer 
may test the response time of his vendor by issuing an order for a 
given shipment and recording the delivery date of the shipment. The 
salesman may enter a motel and take a shower. He could test the re- 
sponse time of the hot water system by measuring the time between a 
turn of the “hot” faucet and the appearance of hot water. 

The response of a system to an input may be a single output, delayed 
by the response time of the system, or the output may he distributed over 
a length of time (a series of outputs will be initiated by a single input). 

Example: Two interesting examples involve the use of radioactive 
tracers in medicine. The first involves study of the human circulatory 
system, the second involves the operation of the human heart. 

To test circulation a small quantity of sodium chloride solution, 
tagged with the radioactive isotope Na 24 , is injected into a vein of the 
patients forearm. A Geiger-MuIIer counter, which detects radioactivity, 
is then placed in contact with one of the feet. If the blood circulation 
is normal, the presence of radioactivity is very soon detected in the 
foot; it increases rapidly and reaches a maximum value within an 
hour. If there is a circulatory impairment of some kind, however, the 
radioactivity will increase slowly, showing that the blood has difficulty 
reaching the foot. By moving the counter to different parts of the 
body, the position of the restriction can be located and the necessaiy 
treatment can be applied. 

To test heart pumping action radiosodium chloride can again be in- 
jected into the bloodstream. A counter attached to a recording pen is 
placed over the heart. As the radiosodium enters the right side of the 
heart the count rises, then drops as the venous blood enters the lungs; 
a few seconds later, the radiosodium appears with the arterial blood 
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in (he left side of the heart and there is another rise and fall of the 
count. By studying the resulting curves, the pumping action of the 
tuo sides of the heart can be compared and abnormalities can be 
discovered.* 

The response time of a system may also be distributed statistically 
about a mean value, rather than be a single feed value, or single feed 
sequence of outputs. 
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Similar outside response tests can be made for systems that transform 
information inputs, rather than physical inputs, although the same con- 
servation Jaws do not apply, because an input of information may be lost 
(and in the long run generally is). 

We may ask, for example, if an input message is transformed without 
loss of variety by a system, regardless of hoiv the message is transformed. 
Thus, if the input to a system is A, B, C, and the output is 1, 2, 3, there is 
no lass of variety because the transformation is “one-to-one.” On the other 
hand, if the input is A , B, C , and the output is 1, 1, 2, there is a loss of 
variety because the system codes both A and B into 1, and only C uniquely 
into 2. From such observations we may develop notions (and measures) of 
distortion, fidelity, and other response measures for the characteristics of 
information transformation systems. (The Second Law of Thermodynamics 
applies to information systems. See Appendix D.) 

Outside testing can also check that the assumptions of the system 
definition with regard to input conditions are met in practice. This is not 
a response test, but assures us that the response obtained is for the cor- 
rect input conditions. 

Example: The raw material inputs to a manufacturing process are 
usually routinely inspected to assure their conforming to specification. 
Such a test is common for the input supply voltage to electronic equip- 
ment, for the data used as input to a mathematical computation, and 
for inputs to human patients, such as the triple checks of prescription 
correctness by some pharmacies. Such checks of input conditions may 
often be conducted routinely by the system itself, as for example, in 
data-processing operations, where inappropriate inputs are rejected. 

If the system itself does not reject inappropriate inputs, however, the 
analyst should ascertain that the inputs to the system under study meet 
his assumptions or specifications. 

The Danger in Using Outside Tests 

Unfortunately, although inside tests are more likely to cause unwanted 
system change than outside tests, outside tests can have the same effect 
if the system has some memory ability. The classic example of this danger 
is the land mine, discussed on page 32. The investigator who applied 
outside pokes to the mine might conclude it was harmless, only to have 
his tests themselves move the internal position of a firing mechanism. In 
the same way, a series of inputs to 3 system can modify the system itself 
by altering its memory content, its interna] connections and conditions of 
response, or its goal-changing functions. 

Such internal changes may be harmful or beneficial to the system (or 
the analyst), depending upon the kind of change generated by the tests 



446 Systems Analysis. A Survey 

m i the purpose of the analyst. Tot «ampK testtng itself may imptove 
the system's opmum: Teachers hope them inBnicbon and tests wUl 

ben Out l ^m“ d oT«uise, is that the application of outside tests without 
any knowledge of internal structure and function may produce surprising 
results, and outside tests must therefore be used with caution, even 
though they are die only, or the easiest way to proceed. We do not nor- 
malty “see what will happen when we push the A-hutton” unless we know 
something about the internal workings of the system under test. 


Systems Analysts 

Tests or Questions 


OUTSIDE 

Response Tests 
Frequency 
Overload 
Stability 

Fidelity to specificauons 

Henson 
Prediction 
Limits of operation 


inside 

Structured Tests 
Omissions 
Compatibility 
Coupling effects 
Alternates 

Reason 

Check definition 
Understand structure 
Troubleshooting 
Improvement 


9-2 A brief outline on systems analysis. The outside tests treat the system ItseU 
as a black box. The inside tests probe within the system mainly to ask questions 
about structures, malfunction, and improvement. (Taken from D. Hertz and R. Eddison, 
Eds . Progress In Operations Research. Wiley, New York, 1964, Vol. II, p. 136.) 


INSIDE TESTS 

When it is possible to probe the system so that we can make internal 
specifications and test them, a wide range of point-to-point checks be- 
come available to us. Indeed, some inside tests may be easier to make 
than outside tests. 


In many systems internal inspection and test are relatively easy, and 

TT ,haI ,nitfucts at hi* t*n!. a* one anonymous Arab peda- 
5T™? 1 *- j •***/■"» "?** P^7, and you defile me w«h your v er *r, I make 

you an archer, and you shy me with your bow." 
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probing will not in any way affect the internal structure or condition of 
the system. 

For example, the continuity and correctness of flow or connection be- 
tween system elements may often be observed or tested without damage 
to the elements. 

Example: Suppose we know the system flows should occur in a cer- 
tain sequence or between specified elements and not between others. 
Such specification may be summarized in a “from-to” matrix, as previ- 
ously discussed on page 97. Each possible from-to connection speci- 
fied may then be checked against the flows or connections observed. 
Flows that should be present and are not and, conversely, flows that 
are present and should not be can be checked in detail. Automation 
of such “continuity’’ and "short" tests in checkout of electronic equip- 
ment, such as the point-to-point check of IBM circuit modules by 
computer, is current practice. 8 Detailed checks of paperwork and ma- 
terial flows are a similar example. 

Moreover, it may be possible in some systems to get an over-all view 
of the internal workings of the system from the “outside” without disrup- 
tion of the system. For example, after removing the protective cover from 
a piece of machinery, visual inspection may reveal components that are 
melted, discolored, worn, disfigured, or damaged, due to extremes of en- 
vironmental conditions, wear, breakdown, or abuse. Some “outside" measur- 
ing techniques, such as X-rays and the techniques of radiography and 
thermography, allow internal inspection of system construction or opera- 
tion without damage to the system. When such forms of inside inspection 
are innocuous they may allow the analyst to proceed with extensive inside 
testing at will. In the same way, nondestructive readings of documents, 
computer memory content, dated parts, and similar obvious internal checks 
can often be made without altering or modifying the system operation. 

Frequently, although it is impossible to view the entire internal work- 
ings of the system without difficulty, “windows” in the system may allow 
partial inspection. 

Example : The human eye, for example, is such a window, allowing the 
physician to “see” a selected internal portion of the body without violat- 
ing its boundary. The physician can look in while you look out. The 
physician uses this ability to check the effects of elevated blood pressure 
(an outside measurement) upon the internal capillaries by direct ob- 
servation into the depths of the eyeball. Many famous discoveries have 


* See the quotation given as case m. Problem 9.1. 
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been made when such a “window" has been found in an otherwise 
blackbox.* 


Selection of Probe Poinls and Probe Types 


When inside testing becomes difficult or dangerous, or when inside tests 
may disrupt the system, we must carefully select the probe points and probe 
types. In addition, because the possible probe points and probe types in 
testing a complex system usually exceed the capacity of the analyst or his 
need to employ them, and because tests consume effort and time, the 
strategy of test selection becomes an important issue for inside testing- 
The same argument holds, of course, in the selection of outside tests to 
present to the system, but the increased difficulty, cost, and danger of inside 
testing heightens the problem. 

The effects of internal probing for information, usually referred to as the 
“probe effect," provides one guide to the choice of probe points and probe 
types. In most inside system tests (and m some outside tests) the presence 
of the measuring or testing medium (whether it be an instrument or a 
human observer ) changes to some extent the character of the system under 
analysis. The question is “What will be the effect for different probe points, 
and for different types of probes, or methods of questioning?" 


Example. To take an obvious example first, suppose we propose to pur- 
chase a small business at the request of its proprietor, who tells us the 
firm’s assets (cash, inventory, fixtures, etc.) have a total value of $X, 
broken down into the various accounts. Being reluctant to accept this 
statement at face value, we insist the proprietor employ a Certified 
Public Accountant to audit the books of the firm, assess the value of 
the inventory, and so on This auditor charges $Y for his services, a sum 
paid from the current assets of the firm. If SY is a large percentage of 
the assets $X, the act of auditing will have substantially changed the 
firm s asset condition-the measurement and test will have altered the 
system-But, if SY is a small percentage of $X, the probe effect will be 
; “<*>’ ln hardware systems is xvhen a measuring instrument 
overloads the system s capability, thereby producing erroneous results. 

ana J >S “ ° f SOCial and human s >' stems - { he probe effect 
increased. Thus, crude interviewing techniques or the known presence of 


' srszjs Sst-jSStsst- r* - “t rts 

permanently opentd by a gunshot wound. “n,e W \ ose had been 

l-miblc an oaploratlon of ; in. oil chan ”' noennenre made 

Acomstanoo, ln .JmtaUy ■» tMerent 

p-tfomic<l capodmonti ofiich made libtory incidence" W b”cS?. f>«.umont 

■ Incestlsator, Norton, New y 0 iJf. p »9. Cannon - Tft* Way of on 
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an ‘observer” In a group meeting usually alters drastically the pattern of 
social interaction, and the analyst may unwittingly or purposefully be de- 
ceived . 10 Field or clinical observations by psychologists, psychiatrists, so- 
ciologists, anthropologists, and political scientists are subject to this probe 
effect, and some writers have observed that little should be expected of 
quantitative analysis in these areas as a result . 1 1 

Consequently, the analyst must select a probe point and type that 
will not produce a distorting effect. At the same time, he hopes to find a 
probe point and type that will reveal to him, for his time and effort, the 
most information concerning the test system. Although a balance between 
these /actors is easier to call for than to obtain in practice, the analyst can 
frequently select a “critical” test point and test type to exploit, even in 
very complex systems. 

Example: In 1954, the CIA received a report pinpointing a terminus of 
telephone wires serving East German military and civilian officials— a 
conglomeration of wires that would handle simultaneously 432 calls. 
This “find,” if it were true, provided an excellent probe point into the 
East German Communication system: It would produce voluminous 
results if it could be tapped. 

Further investigation by mapping and surreptitious inquiry revealed 
that the connection point was five feet underground below the village 
of Alt-Glienicke on the Soviet side, yet only 600 yards from the American 
Zone border village of Rudow. With this information available, a “cover" 
operation was put into effect. An Air Force radar station was constructed 
at Rudow, while a tunnel to the connection point was dug. Dirt and 
debris removed from the tunnel were carted away in small containers 
disguised as various Air Force equipment. When the tunnel was com- 
pleted (in record time and under conditions of utmost secrecy), the 
lead-covered telephone cable was tapped, and the 432 wires were con- 
nected to 432 tape recorders. For a year complete interception of con- 
versations was made until the tap’s discovery on April 22, 1956. 

“ Many anthropologists have been embarrassed to discover after years of study that the 
“Old Chief” of the tribe had been spinning yarns, rather than relating facts, In the 
belief that his friend the anthropologist wanted to hear something interesting. 
n "In other words, in the social sciences we have to deal with short statistical runs, nor 
can we be sure that a considerable part of what we observe is not an artifact of our 
own creation. An investigation of the stock market is likely to upset the stock market. 

We are too much in tune with the objects of investigation to be good probes. In 
short, whether our investigations in the social sciences are statistical or dynamic— and 
they should participate in the nature of both— they can never be good to more than a 
very lew decimal places and, in short, can never furnish us with a quantity of verifiable 
significant information which begins to compare with that which we have learned to 
expect in the natural sciences. We cannot afford to neglect them, neither should We 
build exaggerated expectations of their possibilities. There is much which we must 
leave, whether we like it or not, to the un-“sdentific,“ narrative method of the profes- 
sional historian.” N. Wiener, Cybernetics, 2nd ed , Wiley, New York, 1961, p. 164. 
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The discovery of the probe clearly altered the operation of the ob- 
served system, tat previous to its discovery the delicacy with which t 
probe had been inserted and the critical nature of the probe point pro- 
vided a maior intelligence triumph . 11 

An interesting feature of this wiretap example is the number of 
measures taken to disguise and alleviate the effects of probing into the East 
German Communication System We are not told the technical details rt 
the electronic skill required by the successful wiretap, tat the expensive 
and laborious efforts to disguise the tunnel itself are evident. 

Thus, in the insertion of probes for sensitive inside tests it may be pos- 
sible to devise counteracting treatments for the system to disguise the probe 
or the probe effect. For example, preparations that will allow a probe to 
he made in such a way that the subject will not be perturbed, embarrassed, 
scared, upset, disheveled, hurt, or injured are essential to the work of 
analysts in individual and group problems. The office environment of the 
physician or legal counsellor is designed to insure privacy of interview, 
as are the laws of disclosure. The interviewer gauges his attitude to inspire 
confidence in the respondent or to place him off guard. The surgeon em- 
ploys anesthetics to mitigate the pain and apprehension caused by an opera- 
tion. The psychologist observes a group of children at play through a one- 
way mirror. Each of these artifices represents an attempt to reduce the 
probe effect 1 * 

Similarly, the ability to transform variables or to correlate measurements 
aids in reducing the probe effect 

One of the more difficult higher order inside investigations (and from 
what we have seen in previous chapters, one of the most valuable) is the 
determination of long-range system goals and system values. In fairly simple 
systems, an inspection of system structure, memory content, and possibly 
goal formulation procedures may reveal the system's needs, but in more 
complex systems such direct probing is not likely to produce the desired 
results. In such systems the device is often to change the probe type to one 
which is not alarming, yet which has results highly correlated with the de- 
sired, but offensive question. 

Example: 'Using the 'indirect’ procedure, questions are asked, for ex- 
ample, about how radical or conservative certain magazines or men in 

“A. TuCr, CM- The ZiuWeStorv, Momrw, New York, 1962, pp. 2-6 According to the 
.other, the Sonets opened the Rpdow tonrrel . toon* attraction until Tone 9, 1958, 
when an annonneenreot was made tot '-10,000 Worker Dele®!., f,„„ £„« C imuay 
and the Soviet Union have seen this taorriph of American espionage" 

For some interesting interviewing approaches, see I. F. Marco,so„, AdoefUona In 
Irilercleurnc, John Lane (Bodley Head), London, 1920. Other Interesting examples 
M ‘ ° eu!! , d ’- *"d S- W. Cook, Editors, Re,Lfh Methods 
in Social Relations, Fart Ttco: Selected Techniques, Dryden, New York, 1951. 
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public life are. The respondent who calls them all conservative indirectly 
reveals his own ‘radicalism’ and conversely with one who rates them 
as radical.” u 

Scientific measurements that would otherwise be destructive or difficult 
may also be conducted by measuring a variable that can be transformed 
mathematically or physically into the desired quantity. For example, the 
measurement of the moisture content of a sample of raw material may be 
difficult or impossible. Rather than attempt the direct measurement, some 
moisture-measuring instruments measure the amount of radio-frequency 
(microwave) energy absorbed by the sample, a quantity that can be re- 
lated mathematically to moisture content. The latter measurement is easy 
and nondestructive. The field of scientific instrumentation abounds with 
similar examples, and the ability to make such “mathematical” transforma- 
tions allows measurements to be taken at a distance and "telemetered” 
back to the observer, who may be miles from the measuring point. 


PARTIAL INFORMATION 

In most systems we encounter, we are not driven to the extremes of outside 
tests alone, or inside tests alone. We have at least partial information con- 
cerning the internal operations of the system, so that we may select outside 
tests more intelligently, and we may have at least a few easily made inside 
tests. 

For example, if we recognize a system to be a member of a larger class 
of systems, we know something about it although not perhaps the complete 
details of its operation. If we have a history of the particular system at hand 
We know something about its present condition, although perhaps not 
everything. 

Such partial information greatly narrows the range of possible tests 
that may be needed and permits us to make a good guess (or a logical or 
probabilistic choice as in Chapter 11 ) of the next test procedure. 

Moreover, if we can make some inside tests in addition to outside tests, 
based upon our partial knowledge of internal system structure, our variety 
of possible testing procedures is greatly increased. 

For example, with the combination of outside tests and a few internal 
probes we may check the response of selected parts of the system, rather 
than its over-all response and thereby isolate the system part that may be 
improved or corrected to improve or correct over-ail system response. 

Similarly, it may be possible to inject an input internally, and then by 

“A. Kornhauser, “Constructing Questionnaires and Interview Schedules," M. Jahoda, 
M, Deutsch, and S. W. Cook, Research Methods in Social Relations, Dryden, New 
York, 1951, Vol. 2, p. 452. 



outode testing to define internal system structure more clearly. (Combined 
strategies of this type are discussed in Chapter 10. ) 

Example. The physician may inject dye into the bloodstream so that 
specified portions of the circulatory system will appear on an X*jay 
picture. Although the physician penetrates the body to make the injec- 
tion, such penetration is seldom dangerous or particularly harmful to 
the patient. Angiograms and arfertogronw, which reveal clearly the de- 
tads of selected circulatory functions, are made routinely by this method 
in the analysts of human circulatory problems. In one medical refinement 
of this technique it is possible to mo\e the X-ray camera during exposure 
so that only one plane through the patient’s body is “in focus" and, 
further, by repeating the process with diSerent camera motions, to take 
photographic slices through the patient’s organs without injury or dis- 
comfort to him. Similarly, the “signal tracing" of tagged documents and 
similar system inputs to an organization may be detected at selected 
internal test points. By selecting the inputs carefully, specific paperwork 
functions may be tested. The latter approach may also be used in the 
diagnosis of hardware systems. (Techniques of this type are discussed 
more fully in Chapter 10.) 

With the objective of combining outside and inside tests, a physical sys- 
tem may be designed so that selected inside measurements can be extracted 
at “test points" or output terminals from the equipment. Thus, certain moni- 
toring points are built into a chemical processing facility with readings 
displayed on a control panel. Check answers in a computer program are 
often printed out during the progress of the program’s execution. To reduce 
the cost of maintenance, trouble-shooting and repair, many electronic sys- 
tems are constructed so that inside tests can be conducted from outside 
terminals, inspection points, and metets “inserted" for this purpose. Al- 
though this design approach increases the initial cost of the system’s design 
and construction, such action can drastically decrease maintenance and 
trouble-shooting difficulty, and is usually necessary for detailed “on-line" 
control of a complex system. 

Tims, the analyst usually h as partial information about a system— either 
from experience, as the result of specific test procedures, or by design- 
ami his ability to use this knowledge greatly increases his diagnostic and 
control ability. Wiat, w c might ask. would happen if the analyst had “com- 
plete information concerning the system under study? 


complete INSIDE INFORMATION ABOUT STRUCTURE AND FUNCTION 

If »e arc totally WvMgrabfc about the inner « 0lWl)ss o! a „, lOT (a! „ 
gnen let el of detail), esen greater flexibility in testing is available to us. 
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First, if we could conduct independent and nondisruptive internal tests, 
we could check the transformations at each stage or component of the 
system. More detailed isolation of faults would be possible, and several 
relatively simple tests, such as the correctness of subsystem output on a 
point-to-point basis for a given system input, might be possible. A detailed 
examination of internal memory content, or system status, at a given mo- 
ment or step in a process is another example of internal tests, when we have 
complete knowledge of the system. 15 

Conversely, with complete knowledge of what the system should do 
internally, which may, for example, be known from design criteria, it may 
be possible to devise a series of outside response tests that will sequentially 
check selected system functions in a “diagnostic routine” of test problems. 

The logic of such diagnostic routines will be explored in Chapter 11. 
However, in simple terms, as shown in Figure 9-3, if we can find inde- 
pendent blocks in the system, and if we can find input tests that will re- 
quire the correction operation of one combination of systems blocks at one 
time and another combination at another time, we may be able to check 
correct block-by-block operation by logical selection of the block combina- 
tions to be tested. In computer design, for example, system blocks are often 
constructed so that diagnostic routines may be developed for them. Even if 
a complete diagnostic checkout is impossible or unwanted it is still possible 
to develop specific diagnostic routines for differential diagnosis of a 
selected set of symptoms. 

Examples of this approach— based on outside tests and a knowledge of 
system structure and block transformations— can be found in many areas. 
For example, an accounting system may be audited electronically by a “test 
deck” of trial conditions entered into the electronic computer. This diagnos- 
tic routine checks the accounting system, its files, and the computer function. 

Example: Suppose we wish to test an accounting system, which, con- 
verted to an electronic computer, computes a payroll for a large number 
of employees. If we define when we should pay each employee, and 
thus define all other conditions as “illegal,” we can compose a sample 
list of employees, state the conditions under which they worked, and 
feed this list to the system for evaluation. Some tests might include the 

“The "failure” of the airborne computer in the first Gemini V space launch may not 
have been a failure of the computer at all according to NASA reports Although the 
computer was working satisfactorily, something had altered the contents of the com- 
puter’s nondestructive readout core memory unit. Although no reasons for this change 
have been advanced, the ability to isolate the change was based on a complete knowl- 
edge of what should have been in memory at the time of test. See J. Mercer, "Gemini 
Malfunction, Was It the Computer?” Control Engineering, July, 1965, p. 24. (Later 
investigation showed astronaut switching procedures had placed the computer on 
battery power for too long a period, depleting the battery and causing the unexpected 
memory change. Control Engineering, August, 1965, p. 32.) 
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9-3 The principle ot a diagnostic response test. Selected Input A requires the 
correct operation of the sequence of elements shaded in the upper diagram. Selected 
Input S requires the correct operation of the sequence of elements shaded In the 
lower diagram If input A gives the correct output result, put input B does not, there 
is a high probability that element X 13 defective Progressive tests of this type can re- 
duce the probability that elements tn the common shaded chain are causing the fault. 
(Taken from 0. Hertz and R Eddison, Eds., Progress in Operations Research, Wiley, 
New York. 1964. Vol II, p 138) 


names of employees not on the payroll, employees who were on the pay- 
roll, but just quit, employees assigned to the wrong department or with 
erroneous wage rates, employees who supposedly worked an “impossible” 
number of hours during a week, and so on. Should any of these people be 
paid ( on paper, of course ) a fault in the system will be indicated and 
probably isolated. The same procedure can also be applied to other 
accounting and record-keeping functions, such as accounts payable, ac- 
counts receivable, cash disbursements and receipts, inventory, and 
capital accounts. 1 * 


In the remainder of this book we will discuss in more detail several 
testing, trouble-shooting, and diagnostic procedures. We will see that test 
selection is, by analogy, the same as the initial selection confronting the 
analyst when he first defines his system, and that the methods of systems 
simplification discussed in Chapter 7 and 8 are also applicable here. Once 
the system has been specified, however, and its definition “frozen” or tem- 
porarily agreed upon, that framework guides our further analysis. We will 
now turn to methods of system simplification in analysis. 


“Dept. of U.S. Air Force, Guide for AudiUng Automatic Data 
eminent Printing Office. Washington, D.C., 1961. 


Procesting Systems, Gov- 
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PROBLEMS 

9.1 Several response tests made from the outside of a system were cited in the 
text. This is the most widely used form of testing, one we use in everyday 
life when we converse with others, turn on an electric light, or start a car. 
It is, however, important to understand when this form of testing is likely 
to be most successful and when it must be used with care. 

To illustrate this point, here are three cases in which outside response 
tests are proposed. What are the advantages and dangers of outside testing 
in each case? Where is the outside approach necessary? When is the useful- 
ness of the approach greatest? In which case would your confidence in the 
results obtained be the greatest? Why? 

case i: From a book by Theodor Reik (Adapted from Listening with the 
Third Ear, by Theodor Reik, by permission of Farrar, Straus & Giroux, Inc. 
Copyright 1948 by Farrar, Straus & Giroux, Inc. and The Inner Experience 
of a Psychologist, by Theodor Reik, by permission of George Alien and 
Unwin Ltd. Copyright 1949 by George Allen and Unwin Ltd.), p. 23, we 
have the following quotation: “. . . [the hunch of psychoanalysis] is that 
men reveal themselves— all their emotional secrets— when they talk freely 
about themselves; not just when they talk about their secrets, but about 
everything concerning themselves.” Later, on p. 126, we find a similar, al- 
though apparently contradictory, quotation. “Mahler once remarked, The 
most important thing in music is not the score.’ In psychoanalysis, too, what 
is spoken is not the most important thing. It appears to us more important 
to recognize what speech reveals and what silence reveals.” (The outside 
test here is the analyst’s remark, or silence, and the response is the patient's 
verbal output, or silence.) 

case n : In an attempt to automate the auditing of financial and account- 
ing Systems implemented by electronic computer, many industrial firms 
and government agencies (e.g.. Department of the U.S. Air Force, Guide 
for Auditing Automatic Data Processing Systems, Washington, D.C., 
Government Printing Office, 1961) have proposed the development of a 
series of “test decks” of cases for input to an accounting, or other form 
of data processing system. (Indeed, the use of test decks is a standard 
procedure for testing newly designed programs, and the experienced pro- 
gramer or systems analyst spends many hours devising the most difficult 
and extreme cases his system can encounter to test its operation under these 
unusual, as well as usual, conditions.) The purpose of the test deck is to 
determine if these diverse inputs are responded to in the appropriate, con- 
ditional manner by the system. Presumably, if the range of response tests 
is great enough we may assume the system is free from error, as well as 
from purposeful fraud. (The test deck approach is currently a highly con- 
troversial auditing method for accounting systems, but it is gaining accep- 
tance in many areas. Why this diverse opinion?) 

case m: From an IBM press release dated October 12, 1965, we have 
the following quote: “Endicott, N.Y.— Microcircuit assemblies for IBM 



255 Systems Analysts: A Survey 

System /3 60 travel through a computer-controlled final test line The three- 
station machine, controlled by a specially modified IBM 1410 data process- 
ing system, can perform more than 1000 short-open tests, scores of im- 
pedance tests, and hundreds of logical integrity tests on every assembled 
card The tests, at an average rate of one card a second, represent the 
last step in the production at Endicott of the printed circuit cards used in 
System/360.” (Similar approaches are used in final missile check-out, and 
are proposed for automatne testing m the field See Problem 9.3. Although 
special test probes are often brought out for response testing purposes, for 
the purposes of this question, consider this application an outside test.) 

9 2 Consider the following example of a diagnostic routine. The Allen Electric 
and Equipment Company manufacturers a small computer (in the $7500- 
S 10,000 price range) that hooks up electrically to an automobile with 
eight leads in five minutes The machine performs up to 80 diagnostic tests 
on the cars ignition and carburetion systems in ten minutes or less. The 
sequence of these tests and the standard specifications of the manufacturer 
of the car are set by an IBM-type card inserted into the computer. (See 
cut of a card for 1966 Chevrolet, below ) 
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Test results showing “good” or “bad” and test notes are printed on an- 
other card, which becomes a record for the customer and the shop. (See 
second cut, below.) The computer tests can be expanded to include other 
auto systems, such as cooling and exhaust, as these test functions become 
economically feasible. The test unit was inspired by units used to check 
the jet propulsion systems of missiles at Cape Kennedy. (This machine 
is described in more detail in “Computerized Engine Analyzer Could 
Revolutionize Car Care,” Motor Age , February, 1966, p. 101.) 

A. Under what conditions could such a diagnostic routine of tests be 
applied to other systems? 

B. What are some of the advantages of such an approach to auto maintenance 
for the car owner? For the shop owner? 

C. Could such a testing approach be used if the specifications of auto 
performance under different conditions were not completely known? 

D. What are some of the dangers, if any, in the use of such equipment? 


XAWf - MAKE. 

ADDRESS _YEAR_ 


j PATf cr TEST No . 


tooo nine nit 


150 RETRACTABLE HEAOLIGHT OPERATION 

MARG 

149 HEADLIGHT CANOLEPOWER L H HIGH BEAM 

MARG 

148 HEADLIGHT ALIGNMENT 1 H HIGH BEAM 

cffffir 

147 HEADLIGHT CANOLEPOWER L H LOW BEAM 

FAIL 

146 HEADLIGHT ALIGNMENT E H LOW BEAM 

GOOD 

145 HEADLIGHT CANOLEPOWER R H HIGH BEAM 

FAIL 

144 HEADLIGHT ALIGN VtNT R H HIGH BEAM 

COOB 

143 HEADLIGHT CANDLEPOWER R H LOW BEAM 

COOO 

147 HEADLIGHT ALIGNMENT R H LOW BEAM 

MARG 

241 WHEEL ALIGNMENT - CAMBER - 1 * 

MARC 

140 WHEEL ALIGNMENT - CAMBER - R R 

FAIL 

139 WHEEL ALIGNMENT - TOE IN REAR 

;ooo 

138 STEERING WHEEL SPOKE POSITIONING 


137 WHEEL ALIGNMENT — l CASTER 

:ood 

136 WHEEL AllGHMEHT - K CASK!! 

FAIL 

135 WHEEL ALIGNMENT - L CAMBER 

FAIL 

134 WHEEL ALIGNMENT - R CAMBER 

GOOD 

133 WHEEL ALIGNMENT -TOE IN 

MARG 

132 MAXIMUM ROAD HORSE POWER 


iji automatic 7AAVS sum pattern t minion 

MARG 

130 CLUTCH PEDAL TREE TRAVEL t OPERATION 


129 OBSERVED REAR WHEEL BALANCE 

MARC 

128 Rear Wtieel Br vs Speedometer Test It 50 MPH 

Good “ 

127 Rejr Wheel O'ne Speedometer Test it 30 M P H 

GOOD 

176 CARBURETOR PCtvfR VALVE CKRtHOH 


125 INTAKE MANIEOID VACUUM -ICAO 

GOOD 
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9.3 In several types of systems situations, the analyst prophesies either at Hs 
peril or to his advantage For example, a self-defeating prophecy is one in 
which the analyst's predictions are defeated by a reaction of the system to 
the analyst’s statements. If an analyst predicts that the prices for hogs will 
be very low next year, farmers who hear this prediction may well constrict 
the production of hogs, thereby raising the price of pork. Thus, the analyst's 
prophecy vs mode erroneous, because he made it public. Conversely, sup- 
pose an analyst predicts that the sales of automobiles watt be greater next 
year than the previous year. \f this prediction is incorporated into the sales 
quotas of the distribution organization, the prophecy may well come true, 
because the salesmen will work harder to achieve the goals set for them. 
The prophecy becomes self-fulfilling. 

A Can you think of any other examples of self-defeating and self-fulfilling 
prophecies? 

B Realizing that effects of the type described do occur, bow could you use 
prophecy to manipulate a system? 

C How could the investigation of a system act as a prophecy, as viewed 
by the system? 

D. For what lands of system would an optimistic or a pessimistic prophecy 
be irrelevant? 
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I 


One of the most general problems encountered in systems analysis is that 
of search. If, for example, the system does not work or if it fails to work 
according to specification, we must search for the fault or cause of mal- 
function. In general, there may be multiple faults or causes of malfunction, 
and this considerably complicates the problem. We may have the problem of 
locating a given item in a file. If the desired item (or items) is not uniquely 
described, as is the case in many information retrieval problems, the 
search problem is complicated. A similar search problem occurs when we 
attempt to find a designated target, such as a mineral deposit hidden in 
the earth or a life raft tossed about at sea, for which only partial coordinate 
information (or no information) is available. Yet another example of the 
search problem is that of determining optimal policies or conditions for 
maximizing or minimizing a given objective function. 

Thus, in general, the problem of search is another way of describing a 
selection process in which we attempt to pick out or isolate in a system a 
given point or set of points with specified characteristics (e.g., a faulty 
component, a desired document, a physical location, an optimum adjust- 
ment). Much of what has been said about system simplification is relevant 
to this problem, and it is worthwhile to pursue the subject further, because 
it is also central to the problem of systems analysis. 

In what follows, we will survey a group of typical search problems that 
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ooio( uo several generally useful, search strategies. Although each field 
has its specific techniques, our illustrations are representative of the thinkmg 
that cun be applied in a broad class of problems. Because of their specific 
interest, formal techniques of logical and probabilistic diagnosis have bee 
left for Chapter 1L We now take up, in order, trouble shooting, information 
retrieval, random search, and optimizing search. 


TROUBLE- SHOOTING METHODS 

Problems of maintenance, repair, and fault correction in systems require 
that a given difficulty (or difficulties) be found and corrected. To speed 
this process we employ various methods, each of which attempts to isolate a 
defect in an orderly, progressive manner. Although the following examples 
are taken from current maintenance and testing practices in the hardware 
field, they have general applicability. 

Defection at a Glance 

Observation, in addition to revealing various symptoms of malfunction 
(which may be used as described in Chapter 11 to isolate faults logically), 
may also reveal the solution. If, for example, physical damage (such as a 
melted component, a severed electrical connection, or a hydraulic leak) is 
present when the investigator looks at a hardware system, it often provides 
a direct clue to the system’s failure. Because this form of inspection is often 
the easiest, it is usually the first trouble-shooting method employed. 

Similar forms of diagnosis are often possible when the system has been 
provided with measuring instruments, such as meters, trouble-lights, or 
other automatic detection equipment. For this reason many electrical and 
mechanical devices have metered outputs. The automobile has a gas gauge; 
the radio transmitter has one or more current and voltage meters; the 
hydraulic system has metered pressure and temperature points. 

We often employ senses other than sight in such direct diagnostic steps. 
The sound of the motor tells the mechanic much about its internal operation. 
The electrician looks for overheated components by touch or by smelling 
the acrid charred shellac of electrical insulation. The physician uses even 
more complex forms of physical examination, before he makes other tests. 
He, lor example, observes the patients pallor, the slight shuffle in his step, 
the sound of his voice, the perspiration on his brow, and uses these direct 
observations as his first diagnostic signals. 

We emphasize these relatively obvious observation methods not only 
because they aie the easiest to employ, but also because they are frequently 
the source of effective screening decisions that greatly affect the later 
course of diagnosis. Because major decisions are frequently made at the 
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outset of a diagnosis, ' such decisions should be made to discriminate as 
grossly as possible. And if the initial physical observation is made well, 
the investigator is often led to the solution of his problem at once, or with 
very few confirming tests. 


Short and Continuity Checks 

In hardware systems of any complexity, the system structure may not be 
as desired, either due to faulty construction, or due to deterioration, dam- 
age, or unplanned changes in configuration which occur with time. In such 
cases it may be relatively easy to make point-by-point checks of desired 
structure against actual structure, and to perform such tests in a methodical 
way. 

Example: Suppose a system consists of N components that may be con- 
nected in various specified ways. We can summarize such a system of 
interconnections in matrix form, as shown in Figure 10-1, and test the 
presence or absence of each matrix entry by observation or measurement. 
Two faults may occur in such an interconnected system. First, a re- 
quired connection may be absent, so that the required flow continuity 
is not possible. Such a defect would show up if a required cell in the 
specification matrix did not also appear as an observed combination. 
Second, a required flow between two given points may, in fact, be 
observed to connect two other points. In electrical trouble-shooting, 
special misconnections of this type called “short circuits” provide a name 
for the frequent observation of a long flow path that is shortened in- 
correctly during manufacture, by component or insulation failure, or 



ID-1 Short and continuity testing using matrix specification. The figure shows a four- 
component system which allows connections between each component. When no 
distinction is made between the direction of flow, and when there is only one connec- 
tion between any two blocks, the number of connections is {N 1 - N)/ 2, in general, and 
six here, as shown. In the matrix, the cells to be tested are above the main diagonal. 
Extensions of the method can show multiple ceil entries, or make two-way flow 
distinctions. 
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by dama 6 =. I» many cases, tests of this type can be automated, because 
they »se simple, numerous, and repetitive. 

The mat™ approach to orderly testing may be extended easily to 
M oos ay aosvs, or to multiple connections of different types, either by usurg 
multiple mterconnection matrices, or by delineating the few 
within each matrix cell. Thus, the components may be connected electrically 
and hydrauhcallv, or there may be multiple electrical or hydraulic con- 
nections between each system block. The testing procedure is not altered 
by this additional complexity, although the number of tests to be made 
obviously increases. . 

Similar tests for system structure are also possible for paperwork, m* 
formation flew, and logical systems. 


Signal Tracing and Signal Substitution 

Another large class of test procedures examines the correct transformation 
of a systems blocks in response to a standard test input. This procedure is 
known as signal tracing, and is illustrated in Figure 10-2. For illustration, 
consider a system composed of four blocks connected in series to provide 
a succession of input transformations. A standard test input is applied at 
the input to Block A. It is assumed that the correct transformation is known 
at each step of the process, \.e., that the variables that should be observed 
at the outputs of B, C, and D are known. Assume we also have available a 
probe that will not distort the measured signal at each point, so that the 
observation itself will not affect the quantity measured by overloading, 
or by other forms of the “probe effect.” Then, by progressive comparison of 
the specified output versus the observed output at each system block, as 
we mo%e away from the input, we can always isolate a defective block 
if the blocks arc independent in their operation. 

Example. Four light bulbs are connected in series, so that all must work 
ff the string is to light. The string does not light, and the defective bulb 
(or bulbs) is sought. To do this, the string is plugged in to provide a 
test voltage. We make successive probes continuing from the first bulb 
along the string until the first defective bulb is found by the absence 
of the test voltage. When the defective bulb is found, it is replaced. 
The string then either does or does not light. If it lights, the testing 
stops. If not, we continue the same procedure until the it defective 
bulb has been replaced and the string lights. Although this straight- 
forward signal-tracing procedure can be made more efficient (as we 
shall see), the Brst-to-Ust approach is often used because little thought 
is required in the selection of test points. 
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10-2 Isolation methods using signal tracing and signal substitution The da^ arrows 
indicate the application ot a test signal. The «h,« arrows indicate the appl oahon of a 
monitor or signal detector, (a) Signal tracing with successive prabe points. \ a gna 
applied to the system Input Is traced for correctness away tram the Inpul (b> 8l 0 nrt 
substitution A monitor is applied to the system output, end the correct signal is in- 
serted in pragressive stops away tram the output, (c) Use of slgnal .ubsti Mlon and 
tracer probe to test a single system block tor correctness ot ^ ^ 

tracinn with binarv selection of probe points Either probe point 2 or 2 will be used 
£> pro b, ,. The same _ procedure i ear , be use 
With signal substitute. Binary probe point select, on results In fewer probes to, 
isolation. 

Clearly, the procedure just described can be reversed by the use of 
signal substitution, as illustrated in Figure tO-alr ln thui approach a est 
probe is applied at the output of the system. The T 

block that will produce the standard output is assumed known. Then, the 

reared input a ? t each block is ££»**?* So” S 

“s dSe“. Again, the method will always 

if the blocks are independent, and if the test inputs do not cause probe 

efc Al S a final example the combination of signal substitution and signal 
As a hnal ex p performance of selected system blocks, as 

tracing may be n-d to cst the p rt L , tested by appropriate 
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m assume that this insertion and ptnbe combination can t be i made Without 
aitering the system operation ot the signal and its detection. 


Creating independence 

in systems in which the Mocta do not operate independently, which fre- 
quently happens when leedbach loops are piesent, it is sometimes passible 
to create forced independence by disconnecting the feedback loops and per- 
forming tests specified for that condition. In the extreme, we could possibly 
remove the component blocks from the system and test each block separately, 
according to separate test specifications. In many cases, particularly in com- 
plex systems, components that tested “goad” in isolation will not perform 
correctly in the system. In such cases, additional devices, to he described, 
may be required. 


Binary Splits 

Some thought will reveal that the progressive method of signal tracing 
or signal substitution just described is not the most efficient procedure 
possible. We could reduce the number of probe points or insertion points 
by strategic selection of test points, rather than by progressive selection. 

We can achieve an unproved selection by logically cutting the system 
in half at each successive probe insertion (ot substitution). By a logical or 
binary split, we mean a choice of probe points that will divide the system 
into two parts at each stage, such that the probability of finding the defect 
in either part is onc-half, or as near to that fraction as possible. 

Example Continuing with the four-light-bulhs problem, again assume 
senes connection, so that all bulbs must work for the string to light. 
Assume that we know only one bulb is defective, but wc do not know 
its location. Then, after applying a test voltage across the string, the first 
point for binary detection vs between the second and third bulb, under 
the assumption that all positions for the defective bulb are equiprobable. 
With only a single bulb defective, this probe will isolate it as being 
either bulb i or 2, or bulb 3 or 4. Next, by using one more probe between 
bulbs 1 and % or between 3 and 4, we can isolate the faulty bulb. This 
procedure thus requires a maximum of two steps, rather than the 
maximum of three required by ibe sequential procedure previously 
descnlwd. (See Figure I0-2d. } It is interesting to compare the mass- 
mum number and the average number of steps needed by the output- 
to-input and binary methods to isolate a fault in a system of N com' 
ponents connected in series, with the fault's position equiprobable in 
each of N locations. 
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Example; For illustration, consider N = 4. Then both the maximum 
number and average number of probes using the binary selection method 
will be log, 4=2. The maximum number of probes using the sequential 
method will be three; the average number of probes will be 
1(1 + 2 + 3 + 3) = x = 2.25. Thus, for this case, the binary selection 
method is better in reducing both the maximum number and the average 
number of probes necessary for isolation. The disparity in favor of the 
binary' method greatly increases as N increases. For example, if W is a 
power of 2, then both the maximum number and the average number 
of probes using the binary selection method will be log, N (or at most 
the next integer larger than log, N, if N is not a power of 2). However, 
the maximum number of probes for the sequential approach is ( N — 1), 
and the average number of probes is, using the sum of an arithmetic 
series, 1 


both of which increase much faster than log, N with N. 



Binary 

Sequential 

N 

Maximum 

Average 

Maximum 

Average 

2 

1 

1 

1 

1.00 

4 

2 

2 

3 

2.25 

8 

3 

3 

7 

3.38 

16 

4 

4 

15 

7.96 


The binary splitting approach is thus preferred with N large, when 
nothing is known about the faults location (i.e., all possibilities are 
equiprobable), and the system can be halved at each testing stage. The 
same argument holds when a single fault may be expected with known 
probability at one of N locations. In this case, the system is split into 
parts having equal probability of containing the defective item, or as 
near to this desired split as is physically possible. 

Example: Consider four components in series, one of which is known to 
be defective. The probability of failure for each component in the string 
is given (from past experience) as 

The sum of the arithmetic senes 1+2+3-f +tf = N(N + l)/2 In the prob- 

lem above N — 1 sequential tests are made, and it is assumed that the last probe 
either located defective bulb (in which case the Nth is good, if there is only one 
defective) or not (in which case the last bulb is defective). Thus, a maximum of N — I 
probes (not N) may be required for sequential isolation of a single defective. To 
compute the average number of probes, the (N — l)st probe is counted twice, as illus- 
trated for the case of N = 4 above. 
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Probability of a fault (fot this system. 

Component assuming a single defective) 

Q.4 
0.3 
0.2 
0. 1 
i.0 

The first probe choice that will most nearly divide this system into parts 
basing equal probability of containing the single defective is between 
components 1 and 2 (assuming the components are connected in the 
string 1-2-34.) This choice divides the system into part No. 1 (containing 
component 1 with a probability 0.4 of being defective) and part No. 2 
( containing components 2, 3, and 4 with a probability of 0.6 of containing 
the defective). This probe isolates the defective either in part No. 1 (in 
which case the search is finished) or m part No. 2 (in which case the 
search must continue) If the search continues, part No. 2 is probed be- 
tween components 2 and 3, because this ghes an equiprobable split for 
(he remaining system. Again, this second probe isolates the defective 
either as component 2 (and the search is complete) or as lying in the set 
of components 3 and 4, which requires a third probe (between com- 
ponents 3 and 4 ) for final isolation. In this case, the actual number of 
probes required wiff depend upon the location of the defective com- 
ponent, and may be one, two, or three, as can be confirmed by enumera- 
tion of the four possible conditions (component 1, 2, 3, ot 4 defective). 
The maximum number that may be required, three, is greater than log* 
4 = 2, because a perfect binary split was not possible at each testing 
stage. From the table below, it is easy to compute the average number of 
probes required as 1.9. 

Component 
12 3 4 


(1) (2) Expected 

Probability tiefec- Probes required Prob- value = 

fiLC fn «lnng 0.4 03 0.2 01 to isolate ability (1) X (2) 

Fust bad X 0 0 0 1 0,4 0.4 

Second bad 9 X 0 0 2 Oj 0.6 

Third bad 0 0 X 0 3 0.2 06 

Fourth bad 0 0 0 X 3 01 0.3 


Expected Number of probes = 1.9 
The expected number of probes (1.9) is less than log, 4 ^ 2 because the 
probability of finding (he fault is clustered at one end of the string. In- 
deed, in this case, the binary split procedure is equivalent to the sequen- 


1 

2 

3 

4 
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tial search procedure that starts between components 1 and 2, although 
this would not generally be the case. 

Consider the alternative series arrangement in which the string is 
composed of components in the order 4- 1-2-3. Then, upon rearrangement 
of the probability table above, we would find that the maximum number 
and average number of probes needed for isolation would be two, and 
that the binary approach is superior to the sequential approach, as is 
usually the case for single fault detection. 

These examples illustrate that although the binary search procedure may 
be logically preferred as an optimum method under the stated assumptions, 
it may not be possible or desirable to implement the procedure in every 
case. For example, it may not be possible to find the logically required 
probe points, or these points may be difficult to reach physically, so that 
another method is desirable. The probability of a single failure may not be 
known, or if known, it may not be possible physically to split the system 
into equiprobable halves at each stage. Moreover, if many of the N com- 
ponents in a system are defective at the same time, the power of the binary 
splitting approach over the sequential signal tracing approach diminishes, 
as may be seen by considering the four-component system with three of 
the four components defective. Finally, in complex systems with multiple 
failures, the binary search procedure may be more cumbersome to organize 
and to control than a somewhat longer, yet more obvious progressive signal- 
tracing scheme. 

As a result of these observations, many probe schemes strike a com- 
promise between what can be gained by the binary approach, and what is 
physically possible. Often complex systems will have natural junctions, 
which can be used to split the system into segments. If available, these 
points are used for isolation tests, and good system design practice always 
calls for the inclusion of such isolation points between major subsystems. 

intermittent Failures 

Thus far, we have assumed that a fault, if present, will remain present 
during the course of the isolation search. Unfortunately, this convenient 
state of affairs is not always realized; the fault may be intermittent and thus 
it may or may not be found at a given search step. 

In such situations, which, in terms of time and effort, are by far the most 
troublesome for the investigator, the isolation procedures just described are 
still useful. However, the application of a probe at a given test point must 
cover a sufficient period of time to include at least one observation of the 
intermittent fault. For this purpose, it is often desirable to use multiple and 
simultaneous probe measurements, rather than a detailed series of measure- 
ments. 
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Probability of a fault (for this system, 

Component assuming a si ngle defective) 

1 04 

2 05 

3 0.2 

4 2i 

1.0 

The first probe choice that will most nearly divide this system into parts 
having equal probability of containing the single defective is between 
components l and 2 (assuming the components are connected in the 
string 1 -2-3-4 ) This choice divides the system into part No. 1 (containing 
component 1 with a probability 0.4 of being defective) and part No. 2 
( containing components 2, 3, and 4 with a probability of 0.6 of containing 
the defective). This probe isolates the defective either in part No. 1 (in 
which case the search is finished) or m part No. 2 (in which case the 
search must continue ) If the search continues, part No. 2 is probed be- 
tween components 2 and 3, because this gives an equiprobable split for 
the remaining system. Again, this second probe isolates the defective 
either as component 2 ( and the search is complete ) or as lying in the set 
of components 3 and 4, which requires a third probe (between com- 
ponents 3 and 4 ) for final isolation. In this case, the actual number of 
probes required will depend upon the location of the defective com- 
ponent, and may be one, two, or three, as can be confirmed by enumera- 
tion of the four possible conditions (component 1, 2, 3, or 4 defective). 
The maximum number that may be required, three, is greater than log- 
4 = 2, because a perfect binary split was not possible at each testing 
stage. From the table below, it is easy to compute the average number of 
probes required as 1.9. 


Component 
12 3 4 


Probability defec- 
tive in *tnng 

0.4 

0.3 

0.2 

01 

(1) 

Probes required 
to isolate 

(2) 

Prob- 

ability' 

Erpected 
value = 
(1) X (2) 

First bad 

X 

0 

0 

0 

3 

0.4 

0.4 

Second bad 

0 

X 

0 

0 


0.3 

0.8 

Third bad 

0 

0 

X 

0 

3 

0.2 

0.6 

Fourth bad 

0 

0 

0 

X 

3 

0.1 

0.3 


Expected Number of probes = 1.9 
The expected number of probes (1.9) is less than log. 4 = 2 because the 
probability of finding the fault is clustered at one end of the string. In- 
deed, in this case, the binary split procedure is equivalent to the sequen- 
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tial search procedure that starts between components 1 and 2, although 
this would not generally be the case. 

Consider the alternative series arrangement in which the string is 
composed of components in the order 4-1-2-3. Then, upon rearrangement 
of the probability table above, we would find that the maximum number 
and average number of probes needed for isolation would be two, and 
that the binary approach is superior to the sequential approach, as is 
usually the case for single fault detection. 

These examples illustrate that although the binary search procedure may 
be logically preferred as an optimum method under the stated assumptions, 
it may not be possible or desirable to implement the procedure in every 
case. For example, it may not be possible to find the logically required 
probe points, or these points may be difficult to reach physically, so that 
another method is desirable. The probability of a single failure may not be 
known, or if known, it may not be possible physically to split the system 
into equiprobable halves at each stage. Moreover, if many of the N com- 
ponents in a system are defective at the same time, the power of the binary 
splitting approach over the sequential signal tracing approach diminishes, 
as may be seen by considering the four-component system with three of 
the four components defective. Finally, in complex systems with multiple 
failures, the binary search procedure may be more cumbersome to organize 
and to control than a somewhat longer, yet more obvious progressive signal- 
tracing scheme. 

As a result of these observations, many probe schemes strike a com- 
promise between what can be gained by the binary approach, and what is 
physically possible. Often complex systems will have natural junctions, 
which can be used to split the system into segments. If available, these 
points are used for isolation tests, and good system design practice always 
calls for the inclusion of such isolation points between major subsystems. 

Intermittent Failures 

Thus far, we have assumed that a fault, if present, will remain present 
during the course of the isolation search. Unfortunately, this convenient 
state of affairs is not always realized; the fault may be intermittent and thus 
it may or may not be found at a given search step. 

In such situations, which, in terms of time and effort, are by far the most 
troublesome for the investigator, the isolation procedures just described are 
still useful. However, the application of a probe at a given test point must 
cover a sufficient period of time to include at least one observation of the 
intermittent fault. F 0 r this purpose, it is often desirable to use multiple and 
simultaneous probe measurements, rather than a detailed series of measure- 
ments. 
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For example, suppose a system consists of N components in series, as 
before, and that the system shows an intermittent failure pattern which may 
lie in any one of the N components with equal probability. To isolate the 
intermittent failure, we again apply a standard test signal to the input of 
the system, but we now continuously monitor both the output of the system 
(which is correct most of the time) and a test point at the -middle" of the 
system or as near this point as is physically possible. As the monitoring 
proceeds, the intermittent fault appears in the output of the system. Two 
outcomes are possible for the midpoint probe. Either the intermittent 
difficulty will show up there or it will not. If it does show up, we know 
that the Erst half of the system is at fault, and a new test point nearer the 
input to the system is selected for continued monitoring. If not, we move 
the test point nearer the output of the system. The procedure continues until 
the source of the intermittent failure has been found. (If multiple sources 
are suspected, a progressive sequence of probe points from input to output 
may be preferred, as before ) i 1 
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Stress may be applied to the system as a whole or to its suspect sub- 
systems. In hardware systems, stress is often applied by variation of the 
supplies required for system operation (stuffing or starving the system), 
by extreme variation of environmental conditions (temperature, humidity, 
pressure, vibration), or by requiring extremes of input and output (alter- 
nate overloading and “coasting”). Similar forms of stress are often applied 
to human organizations for similar reasons. 3 Thus, automobile torture 
testing, military survival exercises, and marginal testing of electronic 
components are aimed at the isolation of components, individuals, or 
subsystems that are likely to fail or to show unwanted symptoms in the 
future or that presently show such signs of failure rarely or intermittently. 
The objective is to convert the improbable event into the highly probable 
event on an experimental basis. 

Clearly, the amount of stress applied to a system or to its parts must 
be calculated with some care; otherwise permanent system damage of 
an unwanted type may occur. For example, although it is desirable to 
locate a weak component in a system, it is not desirable to so stress 
the system as a whole that all its components are permanently ruined. 
We may stress a mechanical or electrical system by vibrating it strenu- 
ously, but we do not usually drop it from a 100-foot cliff and expect any 
diagnostically useful results. 

In addition to stress methods often being used for their own sake in 
"sensitivity analyses" of various forms (to evaluate the stability of a sys- 
tem's transformation properties under duress), they may also be used 
in conjunction with the continuous probe point monitoring described in 
the previous section. Thus, if a system contains a suspect component, this 
component may be subjected to stress while the probe points are monitored. 

If no specific suspect is available, stress applied to the whole system 
while the probes are monitored may accentuate the intermittent failure 
or increase the frequency of its occurrence. For example, components of 
an intermittent hi-fi amplifier may be subjected to alternate heat and 
cold (by application of a soldering iron and pinpoint sprays of liquified 
refrigerant gas under pressure), while one or more probe points are moni- 
tored against a standard specification- 

It is interesting to note here that systems designs often create “stress 
points” inadvertently, and that if these points are known or can be 
isolated in the design, potential trouble sources can be forestalled. Field 

» For example, see Problem 13.1, page 439. Stress methods are often applied in personnel 
interviews, in group and individual survival experiments, and in organizational evalua- 
tion by purchasing agents and prospective customers. This form of testing is also found 
in the folklore of most societies. Remember the many young men who completed “im- 
possible tasks” to win the hand of a princess. The extermination of dragons, heroic 
wood-chopping, the location of rare prizes, the placation of family rivalries, and similar 
hurdles have been the substance of stories from Homer to Horatio Alger. 
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tests and torture tests of new products have this objective. Similarly, 
analytical and laboratory techniques are often planned for the same ends, 
as, for example, in mechanical stress analysis in structural design problems. 

An analogy is of interest to the administrator of human organizations. 4 
If the work Sow in an operation has been analyzed, so that its organiza- 
tion of cooperative endeavors is known, it may be possible to Bnd stress 
points created by the organizational design. 

For example, in a senes of operations requiring cooperation, as in a 
system with division of labor, each subgroup attempts to develop a 
stable pattern of interaction, or work, both within the subgroup and 
between die other subgroups in the system. When these stable patterns 
teak down or cannot be built up, the individuals involved experience 
stress or an uncomfortable feeling of pressure and dissatisfaction. “A 
breakdown in the flow creates opposition as the individuals struggle to 
restore it The expected responses from the individuals in the sequence 
prove inadequate, and new coordinate problems arise.” ‘ 

^ P ° i ? S " heiC tKe " 0rV fimv cann °t be stabilized and con* 
amtor 5 SK > u ; n “ of 1°>* reqmred tot the completion of 

ooints m, h 15 r r lhe tontre ' °< a stogie supervisor), stress 
RffaL « 7 b "" Cm "" " 0lk « '!>= superiors of 

nt* ^ev r™ot gr0 l' £ ’ ! r arc 7'" plras ,0 mM "'"to organizational 

z%£z,2i°\; t^i%T;r a r o! r staglc manaser ' 

tween the functirmc f * j * °* ^ em ' ^>T lca ^ conflicts arise be- 
end produ^I Lh ’ production - fiance, betw een maintenance 
Ztl t J 7 roa ' erial handli "8 a " d tospection. The more 
TIL more let TT™ ^ be to ™"Pl« a a S^en lash, 
and after it, the greater then 7 ”7 8 ' a ', itS arti °ns to those before 
is coordinated to the given tasV.*'^ ° * T ° uWe ' un,ess ,he lota l system 

changes in fbc hUeraMioT oTSle'ind-ld ^ ,a ’ e ° f fl °' V a " d si S n ’ ficant 
Of organizational stress." 1 ividuats concerned indicate a point 

tional stress may be'Tnduced^for 7-^-^ 7°“’, d be clear that ' ,t g ani2a - 


operating capacity, time, and resouTT Tr"*' ■ pl, T wse ! b y restricting 
organizational stress noin!< . a S"'™ worlt load. Usually 

'ress pomts become clear a! such restrictions are applied. 

Stock Substitution 

““rr” — - — w — 

• IW?P. 37 re ’ ak ™at. Macmillaa.'S Ors.nl- 
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analysis and systems treatment can be intertwined or accomplished at the 
same time. 


For several reasons, it may be desirable to locate and to correct a fault 
in one step by substituting a new system component or module for a 
suspected defective. 

First, this procedure may be easily and swiftly used in systems con- 
structed on a modular principle, as are most modem systems designs. 
Either binary or progressive substitution may be used, depending upon 
the modular construction and the number of modules presumed defective. 
Thus, if a system can be divided into two parts, A and B, and if the cor- 
responding spare modules are available, direct substitution can isolate 
the defective part. Subsystem A or subsystem B, or both subsystem A and 
subsystem B must be defective. The three possible forms of substitution, 
using the spare subsystem, will not only isolate the fault, but also correct 
it in the process. Although this procedure is fast, it requires the avail- 
ability and substitution of large modules, which, in extreme circumstances 
(both subsystem A and subsystem B bad), could result in total system 
substitution. (Duplicate facilities are often maintained on standby for 
just this purpose.) 

The same procedure would be possible with systems divided into 
more numerous modules, if sufficient spares were available and if 'the 
effort required in the substitution process were minimal compared to 


that required in extensive diagnostic tests. 

For example, returning again to the four light bulbs connected in 
series, suppose the string failed to light as before. Then, if sufficient 
spares were available, we could first replace half the bulbs wit new 
ones, say bulbs 1 and 2. If the string then lights, we could replace one 
of the new bulbs with one of the old bulbs, say in position 1. If the string 
lights with the old bulb, then the remaining old bulb is at fault, it tn 
string does not light with the substituted old bulb, then t at u is 
fault. (A similar sequence would be followed if the string i no 
with the substitution of bulbs 1 and 2 at the first stage. If so, and it 
had sufficient new components, we could renew bulbs an , 
resubstitute old bulbs until the defective or defectives turn up. 
procedure works best when the probability of multiple ai ures is 

A substitution procedure with exactly the same logic may 
by starting with two identical systems, one operative an e ^ 

operative, or equivalently by complete substitution of a defective 

inoperative system to make it work at the outset, o n mo d u ] es 
module, the modules from the inoperative system (or the ° a ^ 
taken from the “rebuilt” system) are resubstituted on a bi 
progressive basis until the working system fa, J moduIes 
of resubstitution is continued until the single , .. » j 

have been isolated. This approach has some appeal when .t can be used, 
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because the storking system provides evidence that the "new" modules 
are all in working order at the tunc of substitution. 

The reader will realize that the block substitution procedures Ju5t 
described are essentially different ways of implementing the signal-tracing 
and signal-substitution methods already discussed. For example, the sub- 
stitution of a “good" system from the input to a given point applies a 
correct input to that point, and is equivalent to synthetic signal substitution. 
Conversely, substitution of a “good” system from a given point to the 
system output is equivalent to probe insertion at that point, as required 
for signal tracing. The approach chosen is thus a matter of physical 
convenience. 

In this section we have considered a number of fault-search methods 
widely used in hardware systems for trouble-shooting. These methods, 
however, are of general use, and, as an exercise, the reader should attempt 
to cite examples of the general procedures in specific fields. Under what 
conditions, for example, can the signal tracing procedure be successful? 
When is block substitution preferred to signal tracing? What additional 
information might improve the trouble-shooting sequence? How can these 
data be organized? How could these trouble-shooting procedures be 
automated? Consideration of such questions will considerably improve 
the reader s appreciation of the approaches cited. 

It is also worthwhile to note that even rudimentary' training in the 
suggested procedures often leads to great improvement in the trouble- 
shooting ability of maintenance and “special project” personnel. 


INFORMATION SEARCH PROCEDURE 

Information retrieval is another example of the search problem in sys- 
tems analysis. In this section, we present a brief survey of the procedures 
used, and note that the forms of search indicated may he expressed also 
in the Boolean Algebra format of Chapter 11. 


The Jfem-Cfiaracterfslfc File 

Information file, are composed of a list of Hem, (such as book, articles, 
MO parcels, aod individual names) that have certain characteristic! 
(such as topics covered, key svords, attributes, and skills),. 

kev ,°.e,a Ia T P l C ' u ““““"S ° f "'Ehl documents described by six 
" c cither appear or do not appear) represents a simpli- 

' X wS? SSS’SXS' "’£>*?«<» rf ties. See R- S. Ledlcy, Fr,^- 

''..k, 1962, chapter II, 
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fied file. For simplicity, the documents wall be numbered (coded) 1-8, 
and the key words will be coded A-F. (The codes used may be structured 
to have meaning, as indicated in Appendix C and the reference below.) 
The associations of characteristics with items, i,e., key words with docu- 
ments in this file, may be displayed as in Figure 10-3, which is a term- 
document or characteristic-item matrix for the hypothetical eight docu- 
ments and six key words. A “1" in a given row indicates that that document 
can be described by the indicated key word. Similarly, a "l" in a column 
indicates that that term describes the indicated document. 


Item Retrieval 

Documents having the desired key words (or items having the desired 
characteristics) may be retrieved or extracted from the file in various 
ways. 

If we can operate on the rows or columns of the term-document 
matrix C of Figure 10-3, which may be possible if the matrix is not 
too large, we can find the documents directly by two procedures, which 
we will illustrate by example. 

Example: Suppose that documents containing both key words A and 
B are desired. Then if the elements of columns A and B of Matrix C 
are multiplied together cell by cell using logical multiplication (1*1 = 1, 
all other combinations equal 0), the required documents will be 
indicated by the l’s that remain in the calculated result. Thus, 


Documents ^ 

A 

B 

c 

D 

E 

Key words 

F 

1 

1 

0 

0 

1 

0 

0 

2 

1 

1 

0 

1 

0 

0 

3 

0 

0 

1 

1 

1 

1 

4 

0 

l 

1 

0 

0 

1 = Term-document matrix C 

5 

, 1 

0 

0 

1 

1 

0 

6 

1 0 

0 

1 

0 

1 

0 

7 

1 0 

1 

0 

1 

0 

1 

8 

1 

1 

1 

0 

0 

0 


10-3 A typical term-document matrix C. In this format, documents are coded 
numerically and key words alphabetically. Each row refers to a document and its as- 
sociated key words. Each column refers to a key word and its associated documents. 
(It is equally possible to work with the transpose of this matrix by the Interchange of 
row and column operations described in the text} 
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and documents 2 and 8 arc the ones desired. (In Chapter Il» this 
form of manipulation will be expanded to include other forms of 
logical combination of characteristics, which will then be called logical 
statements having the value True or False.) The procedure provides 
a direct matching of key words with documents. 

Example. .\n other simple procedure that provides a count of the 
number of matches between the desired key words and those present 
for each document uses ordinary matrix multiplication by a column 
vector composed of G‘s and l’s to indicate the desired key words. For 
example, if documents having key words B, C, and F were desired 
we could form the column vector q, for key words, which has l's in 
positions 0, C, and F and 0’s elsewhere as shown below. Multiplication of 
the term-document matrix, C, by q, m that order, produces the column 
vector r, which describes the number of matches for each document 


WxW-lr) 

Key word Document 



In this example, document 4 matches the three hey words speci- 
Scd and is uniquely selected. However, other documents match one 
ot two of the specified key- words, and may therefore be of some 
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interest. In efficient retrieval systems in which the keywords are 
precisely applied, documents which have the highest value in the 
computed column vector r will be the most relevant. (In practice more 
complicated computation forms are often used to give a relevance 
score for each document in the set, as discussed briefly hereafter and 
extensively by Salton. 9 ) 

Three File Search Methods 

Three forms of search are often used in extensive files, namely, (1) 
search by item for given characteristics, (2) search by characteristic 
for items so described, and (3) search by short tables of characteristic 
with cross references. Each of these methods which we will illustrate 
by using the term -document matrix data of Figure 10-3, is suitable for 
computer use. 

Example: A file may be organized by listing items as a heading, and 
sublisting the characteristics. This produces an item-characteristic file. 
A search on the item, or document in this case, can be made down the 
document list. If the desired key words are present for a given document, 
that document number is extracted. Such a listing of documents need not 
be in any particular order, but the search for all relevant documents 
must encompass the entire list. Physically, such a file organization and 
search procedure corresponds to the search for items by examination of 
every item on a computer tape, or, in the manual file, to the search for 
a sub-set of desired items from a set of edge-punched (McBee) cards. 
(See Figure 10-4a.) 

Example: Conversely, a file may be organized by listing characteristics 
as a heading and sublisting items having that characteristic. Again, 
with our previous definition, this produces a characteristic-item file. In 
this file, the characteristics (key words) must appear in a given order. 
Here we assume an alphabetical order of key-word codes for efficiency. 

To find the required documents, we first place the specified key words 
in alphabetical order. Then, we consult the first specified key word and 
note the relevant documents. Next, we consult the second key word 
and retain the relevant documents for this key word (if any) that also 
match the relevant documents for the first key word. This procedure 
continues until all the specified key words have been consulted, and the 
common relevant documents are isolated. This procedure has the advan- 
tage that only the specified key-word categories need be consulted, not 
the entire list. However, a strict ordering of the key words in the file is 
required so that they can be found easily. Physically, this method corre- 
sponds to the manual use of "peekaboo” cards, illustrated in Figure l0-4b. 

• See G. Salton, ‘Progress in Automatic Information Retrieval," IEEE Spectrum, August, 
1965, pp. 9£U103. This review article contains an extensive bibliography. 
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16-4 Edge-punched Hem cards and peeK-a-boo characteristic cards. In (a), one card 
is made tor each item (document) in a Tile, and the characteristic (key word) is punched 
at specified edge positions Items 1 and 2 ot Matrix C, Figure 10-3, are shown. To 
extract all Items with a given characteristic, a needle is inserted In the edge position 
(hole or notch) tor that characteristic (0 is shown). All items with that characteristic 
tall out ot the file In (6), one card is made tor each characteristic, and items having 
that characteristic are indicated by punching out a hole In a designated item position. 
Characteristics A and 0 ot Figure 10-3 are shown. To find items with given characteris- 
tics, the desired characteristic cards are aligned. Light will pass through the item 
holes that have all the specified characteristics, here documents 2 and 8. 


Example: A third form of file organization and search (known as the 
Tabledex Method) is a tabular combination of the previous procedures, 
which drastically reduces search time at the expense of file complexity. 
The file is organized by creating a table for each characteristic, or key 
word in the present case, and listing in order as subheadings the items 
(documents) described by the given key word. So far the organization is 
similar to the previous example. However, we now expand the table by 
listing for each document shown all of the pertinent characteristics 
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greater m order than the characteristic, or key-word code, of the table 
at hand. To accomplish this we use the alphabetical ordering of key 
words and numerical ordering of documents shown in Figure 10-3 for 
the term-document matrix C. Figure 10-5 shows the complex Tabledex 
file organization for the data of Figure 10-3. Note that the characteristic 
tables are arranged in alphabetical order, that the documents in each 
table are in strict numerical order (where they appear), and that the 
characteristics for each document so listed in a table are in alphabetical 
order and contain entries of higher alphabetical order than the table 
designation (again, where they appear). 

To search for documents described by a given set of key words (or 
items described by a given set of characteristics) we first pot the desired 
characteristics in ascending (alphabetical) order. We then scan the 
Tabledex file for the table corresponding to the lowest order of the 
desired characteristic. We next scan the document list in this table for 
documents (if any) showing the remaining key words. If we find such 
documents, they are the desired ones, and our search ends. We need 
only consult one table in this method of search, 

For example, suppose we wanted the documents described by key 
words B and C. Because these key words are in order, we consult the 


Documents 

Key words 

Documents 

Key words 

Documents 

Key words 

1 

D 

2 

D 

3 

D,E,F 

2 

B,D 

4 

C.F 

4 

F 

5 

D,E 

7 

D,F 

6 

E 

8 

B,C 

8 

C 

8 

(None) 

Table D 

Table E 

Table F 

Documents 

Key Words 

Documents 

Key words 

Documents 

Key words 

1 

(None) 

3 

F 

3 

(None) 

2 

(None) 

5 

(None) 

4 

(None) 

3 

E,F 

6 

(None) 

7 

(None) 


10-5 A tabledex We organization for the term-document data of Figure 10-3. One 
table is prepared for each k e y word (coded alphabetically here). The tables are 
ordered alphabetically (by the code) as shown from left to right down the page above. 
To find a document associated with a given set of key words, the keywords are first 
ordered by their code. Then the table for the first keyword is found. Documents de- 
scribed by the other key words specified (if any) will appear in that table and may be 
• - - ^ i^wiinn without recourse to other Tables In the file. 
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TabWex File and find Table B. The documents described by B are listed. 
We next scan the key-word lasting (or each document shown containing 
C The only documents satisfying the request are documents 4 and S. 

A manual Tabledex file may be constructed as a book of pages, watt 
one page for each keyword table, and the remaining detail printed on 
the page (A computer can be programed to produce such a book if 
desired.) 'Hien, to locate a given document, we turn to the page repre- 
senting the lowest order key word, and End the request {if it exists) on 
that page, by following the above method. 


Searching with Relaxed Conditions 

With any of the above procedures, the search for a given item ( or docu- 
ment) may possibly not lead to any items having the specified characteris- 
tics (or key words) if the characteristics required are too numerous. In such 
a case, the number of requirements must be reduced to find some items 
that are reasonably desirable. 

This relaxation of conditions raises questions about which of the charac- 
teristics to discard and which to retain in a further search. First, a combina- 
torial problem anses. For example, if, say, five key words are used in the 
search for a document and the search fails, ue might think of using only 
four key words, and then repeating the search. However, the number of 
ways four key words can be selected from the original five is five, the number 
of ways three keywords can be selected from five is ten, the number of ways 
two key words can be selected from five is ten, and so on.™ We may bave to 
perform a series of searches of this type before finding any relevant docu- 
ments. The situation worsens severely as the original number of desired 
key words increases. 

Fortunately, several methods combat this combinatorial difficulty. For 
example, the search procedure described on page 274 will give the number 
of matches that do occur for the specified key words, and therefore serves 
to rank the documents in the order of relevance (by the number of matches) 
even though no perfect matching document can be found. 

Example. Suppose, using the data of the term-document matrix [Cj of 
Figure 10-3, we desired to find documents described by key words A, B, 
C, and D. \\ ithout knowledge of what is in the file, we may thus form 
the column request vector q, and multiplication of the term-document 
matrix C by the request vector q (in that order) will produce a column 
vector r which, in effect, evaluates the documents in the file in terms of 
the request. Thus, for the four key words suggested: 

* Brcall that, to grueral, the number of ways r things can be selected from n is 

«7{<r!)(n-r)tJ. 
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l C] X [q] = [r] 

Key word Document 



The results show that no document is described by all four specified 
key words (because there are not four matches), but that two docu- 
ments show three matches (documents 2 and 8), and several show two 
matches. One order of preference would be to inspect the documents 
in the order indicated by the number of matches. 11 

The concept just illustrated is easily extended to the case in which some of 
the key words are more important in the search than others. In such a 
case, the original request vector q may show weighted values for each de- 
sired key word (instead of the binary indication of the previous example), 
and the process can be repeated. For example, suppose we still want key 
words A, B, C, and D, but that the key word C is thought twice as important 
as the other key words. Then we have 


Key word 

[Cl X A 
B 
C 
D 
E 
F 


Weighted 

Request 



Document 

1 

2 

3 

4 

5 

6 

7 

8 


Weighted 

Matches 



Ledley op. cit., pp. 506-510, indicates an alternate sorting procedure to achieve the 
same result. 
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which now prefers document 4. Although such weights may be arbitrary, 
depending upon the investigators choice it is abo possible to generate 
weights based on the possible correlation of key words and documents in 
the file, and such methods are often used in automated retrieval informa- 
tion systems. 11 

Finally, there may be forms of search in which certain key words (or 
characteristics) must be satisfied, and other key words, although desirable, 
axe not necessary. In such cases, the search can proceed by first partitioning 
the file (and therefore the corresponding term-document matrix) by sorting 
out the documents with the mandator)’ key words, and then continuing as 
before. 

Example: Suppose that for the file described by matrix C of Figure 10-3 
we wish to find documents described by key words A, B, and C, but that 
term A is mandatory. Then, by a sort on key word A (and deletion of 
key words D, E, and F, which are now irrelevant to the search), we 
have a reduced term-document matrix C NEW> which may now be searched 
for key w ords B and C as before. 

ICm*} X taaewl ** l r n*wl 

Document Key word Key word Document Matches 

B C 


I 

1 O' 

x b m = i 

V 

2 

1 1 

C [ij 2 

o 

5 

1 0 

O 

1 

8 

.1 1_ 

8 

,2, 


The new term-document matrix contains only rows 1, 2, 5, and 8, because 
these are the only documents in the file hasing the mandatory key word A ; 
it contains only columns B and C, because these are the only pertinent 
key words remaining for the search. The result shows that documents 
2 and fi meet all three requirements, because we know the docu- 
ments have key word A (by virtue of the original sort) and because two 
matches indicate that both key words B and C have been matched.’* 


Indexing 


In the preceding, we hate assumed that the items or documents filed have 
associated urtli them definite characteristics which arc clearly recognized 


° See Sdrt cm, op. Cit , pp. (U-fto 

” I! Jf f by v « t « ““Wpkcatioa used in the examples above 

I'.TSrSS **««**». the number of 
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and which can be attributed to a given item when the file is made up. For 
example, suppose a file shows product assemblies as items, and the attributes 
of these assemblies are their speciSc parts. Then the relationship of part to 
assembly i s clearly defined by product specification, or blueprint, and no 
ambiguity should occur in the assignment of “characteristics” to items. A 
similar situation obtains in highly structured information systems, such as 
the chart of accounts: The characteristics that should cause a given cost to 
be posted to a given account are defined, so that the association defined by 
the item characteristic table is clear. 

More generally, however, the job of assigning key words to documents, 
or characteristics to items, which is called indexing, is not so clear-cut. For 
example, suppose we have a technical article that refers to (1) aircraft, and 
(2) manufacturing, and that these key words are assigned to that document. 
The article in question will not be found if a request is made for all articles 
described by the key words (1) airplane and (2) production. This problem 
is distinctly different from the cases described above for the assembly file 
and the cost accounting file, where the specification of a few ngidly pre- 
scribed terms will locate a unique item, or set of items. In the more general 
information retrieval problem, a given item or document can be described 
in an infinite variety of ways and shades of meaning which places an un- 
realistic burden on the indexing, processing, and request operations. For the 
rigidly indexed file, the terms used by the indexer and the requester must 
be complete and absolutely consistent to provide the desired result. 

A number of devices are used to alleviate this somewhat unrealistic 
requirement. Some intermediate device or step is required to overcome 
inconsistencies and incompleteness which often arise as the popularity of 
descriptive terms changes, as personnel changes, and as the scope of the 
master file increases. 


Dictionaries 

Acceptable term dictionaries are often provided, so that the indexer and 
the requester can use similar words in the input and output operations. Fre- 
quently, this approach is expanded to include a thesaurus of terms to which 
a common code may be applied- For example, aircraft, airplane, and other 
synonyms for the same type of flight equipment may be coded to a single 
identifier, say subject 3.16, which is thereafter used as the indexing and 
search code for filing and retrieval. Similarly, concepts and operations may 
also be given a code number, and a dictionary or thesaurus of concepts 
arranged in a hierarchy of generality may be developed, so that a given 
specification can be related to more or less general specifications, as the 
need arises. Again, the same idea may be used to develop phrase dictio- 
naries in which, say, the terms “cost” and “accounting” are combined into the 
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more useful descriptive term “cost accounting" to be coded and used for 
index and search. 


Such devices are usuall) employed in automated information retrieval 
systems, so that minor sanations in indexing and request formats can be 
handled Another approach, also used in automated information systems, is 
to compute the statistical correlation between terms used in the file, and 
between items that have similar descriptions. This can be accomplished for 
a given file, and revised by rccomputation as the contents of the file 
changes. The computation may also be arranged in stages to suggest alter- 
nate terms for search, or to provide an automatic weighting for the terms 
used in a request and for those terms generated automatically from the 
request 

Tot example, m a given file w e may find tliat the term “aircraft" appears 
frequently With, and therefore is highly correlated with, such terms as 
airframe, fusiUge," “propeller," “acceleration." “jet," and “supersonic.” 
Hmvcvcr die terms associated with “aircraft" may also be associated in the 
y *?’ >'*** may bc thou S ht of ■* a second-order 
cmerJr ^ ”” >SH conU S UOUs "iodising terms for one more 

‘ lead t0 one or more lynwym* for it as those 

synonyms are emplojed in the file itself. By this procedure the term “air- 

have ' ve 

search lor cause of spta dSt I ” ” y appl, “" ons ' '"“”8 &<= 
symptoms. ty ' we are S ,v ®** a set of observed 


SOME RANDOM SEARCH PROBLEMS 

isolating one or more'spectfic' " h ’ Ch ’ ahhou 8 h ‘hey involve 
infinite, population, a'c^f* ^ -* > ^ perhaps 

random sampling. ^ logical sequencing but by 

alternate points promt, a'S™ in\h'ch a™ 'l 0 ™ P “ !ihle 
may l>c useful. Although [t uouM „^ a ntndom search procedure 

method has many appeals. If f or ? Cm t0 at first > a random search 

is chosen, and the desired item is not awi S en “? c seaic ^ procedure 
specific procedure wed, the item «jh nol tbe choice of *e 
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Example: Suppose a particular item belongs to a set of odd-numbered 
categories. If the investigator selects an examination or testing proce- 
dure which considers only even-numbered items, that procedure is 
doomed to failure. It will not find the desired item, and this failure is 
a property of the chosen search method. 

Similarly, a random search procedure is, on the average, one which can- 
not be defeated by opponent strategy'. 

Example: Suppose the police search for crimes in progress by following 
a systematic surveillance route: The radio car makes a systematic tour 
of the community at specified hours. If this search pattern becomes 
known to criminals in the community, they will surely take advantage 
of the systematic pattern and commit their crimes when it calls for 
nonsurveillance. The same argument holds for both space and time. 
And so, many police forces make random tours of surveillance. 

In addition, the use of a random search procedure often gives the 
investigator a limiting condition for item isolation, which can be used as a 
standard against which other search methods can be compared. 

Example: Any knowledge of the probability distribution of an items 
location improves the probability of locating it. But it may be desirable 
to determine how much this information is worth. By comparing the 
probable return that can be achieved from a given amount of search 
effort by one means (assuming some knowledge of a pattern of operation 
or location) as opposed to a random search method (which requires 
no knowledge), the value of the alternate search procedure can be 
assessed. 

We now give several examples to illustrate these points. 

Random Search in an Area 

One of the classic military search problems is the location of a target, which 
may be in an unknown position (and therefore at random as far as the 
investigator knows) in a given area A, e.g., the problem of locating an 
enemy ship in the ocean. 

In this situation, an evaluation of the probability of the target’s detection 
is often useful if a random search procedure is used. 

To develop this point, consider an area A, as shown in Figure 10-6, 
which contains a target X located at an unknown xy location. It is proposed 
to tour the area A by means of a random path of length L. Let us further 
suppose that the target X will be detected by the search procedure if 
the search path passes within a distance W / 2 from the targets location 
(i.e., the search path sweeps out a band of width W symmetrically centered 
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1CI-6 Random search (n an area In the bounded area, there is an object of Search 
X. A random Search path of length L is made through A. It is assumed that the lost 
object will be found If the area of width W about l. covers X in the random search. The 
total area swept out by the search path is Wi. which is a fraction WUA of the total area 
under consideration To derive the equation shown in the text, the area swept out by 
ihe search is divided into a number of small boxes of area WUn, each having a 
probability WL/nA ot containing the lost object X. 


over the search point, and target detection definitely will occur if the band 
includes the target location, and will definitely not occur otherwise). Our 
purpose is to compute the probability of detection as a function of L and 
\V, in comparison to the search area A. 

We first divide the length of sweep L into n equal parts; in effect we 
divide the area \VL of the path swept into n small boxes of dimension 
(1VX L/n). The probability of detecting the target in one of those small 
boxes is given by the ratio of area (WL/n)/A - WL/nA , The probability 
of detecting the target after traversing all the small boxes along L is 


V « 



which follows from the probability that the target will not be detected in 
the sequence of n bores (considered to be independent, because of the 
random form of search and the random location of the target) being one 
minus the probability of detection at an individual box raised to the nth 
power. 14 


If the target is not detected after " tries. It must not be detected cm any of them. 
Thus, by virtue of the rouJlipbcation law of probability, the probabilities of failure are 
multiplied to gt the sequence probably o! fdlure, and the probability of detection 
it one minus this amount. 
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If we now let n become very large (i.e., let the area of the small boxes 
approach zero) we have the limiting condition 1 & 

p = 1 — g—ITLIA 

where e is 2.718. This result shows that as the length of the search path L 
increases, the probability of target detection approaches 1.0. This effect 
can be quickened by increasing the width of the detection band W. 

It is interesting to note that if the area WL swept by the search path is 
so small compared to A, that there is little chance of overlapping, the prob- 
ability of detection becomes simply p = WL/A, the ratio of the two areas. 
The higher-order terms, which in the limit produce the exponential, take 
into account the overlapping that may occur with longer search paths, which 
wilt in effect produce sampling with replacement. Either of the above 
formulas tells us that more effort at random search (larger L) produces a 
situation of diminishing returns. An added unit of effort in search does not 
progressively produce the same added increment in the detection prob- 
ability, but rather a progressively smaller one as the search continues. 

We may now ask bow much effort is worthwhile in a given search by 
this method? This question may be answered by evaluating the cost of a 
unit effort of search and the benefit to be had if the target is found. Thus, 
assuming that the benefit from target detection is $c 2 and that the cost of 
search is $c 2 /mile of L, we may write an equation for the total expected 
benefit from a search of length L as 

Total Expected Benefit in $ = c x (l — er' KUA ) — c z L. 

This equation is a function of L and may be maximized in a number of 
ways. For example, if we take W = 1 mile, A = 100 square miles, c, = 
$1000, and c 2 = $5/mile, successive substitution of trial values of L will 
reveal that the optimum length of search L is, for the assumed conditions, 
about 70 miles. 18 

Because in the random search method described above, the target may 
in fact have been “discovered" more than once, we may ask for the prob- 
ability that the target has been found exactly m times in a tour of length L. 

If we call the ratio WL/A = </>, the “coverage factor” of the search to the 

“ The series of terms generated by ( 1 - WL/nA)' equals e WL / x if n -* «, «, 

The same result may also be achieved by other means. 
j* Tabulated values of e* and e~* may be found m standard works, such as R S. Buring- 
ton. Handbook of Mathematical Tables and Formulas, 4th ed., McGraw-Hill, New 
York, 1964. 
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lola! area A, the probability of -Covering the target exactly m <* 

given by the Poisson distribution' 


P(m,4>) - 




The coverage factor * may be spec, Bed m vanous ways, **“*"6 
upon the problem, but for random search, the basic relatorrshtp holds. In 
general, 4> will represent the expected number of s.gbtings m3d ®' 

For example, suppose that m the area A there are numerous labels, no 
iust one. Then the target density is given by the number of targets N d 
by the area A, Nl A. Tire average number of targets sighted rn a t 
length L will be m = (WL)(N/A) = 4>. The probability of discovering 
exactly m targets in a random tour of length L is given by the 01 
distribution and the new value of * (where, of course, the assumptions at 
random scattering of the targets in area A, random search, ana certain e ' 
tection within W/2 of the path L are maintained as before). 


Example. Suppose a ship passes through a mine field in which there 
are N mines in a given area A. Suppose also that the mines have an 
effective range \V{Z for the ship (i,e., a mine will attach the ship if uie 
ship passes within W/2 of it) Then, <f> = WIN I A as before, the number 
of mines encountered in a path of length L is given by the Poisson dis- 
tribution, and the probability that the ship will encounter at least one 
mine m its tour L is ( L - e - *) . The “mean free path” or average distance 
between mine encounters is l/$ = A/WLN. 


Although more general treatment must be left to the references, it is 
worthwhile to consider one application of this result that gives a clue to the 
range of possibilities for further analysis. 


Example: Suppose a pack of S submarines may operate in one of two 
ways: either independently, or as a group. For simplicity, we assume 
the same detection and search rules as above (i.e., definite detection 
within a band W/2 of the random search path). The probability that 
a ship passing through area A will be detected at least once (by one or 


1 Derivations and extensions of this form of search theory may be found In P. Morse and 
G. Kimball, Methods of Operations Research, Technology Press and Wiley, New York, 
1051, and In four articles by D, O. Koopman, all in Operations Research (The Journal 
of the Operations Research Society of America): “The Optimum Distribution of 
Effort," I, 2, 1953, pp 52~63, “The Theory of Search: Part I, Kinematic Bases,” 
4, 3, 1950. pp. 324-340, “The Theory of Search: Part II, Target Detection,” 4, 5, 
1930, pp. 503-531; “The Theory of Search: Part III, The Optimum Distribution of 
Searching Effort” 5, 5, 1957, pp. 013-028. This form of analysis was widely used 
dunng World War II. 
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more of the submarines) is (1 - r**”*). If F ships per month pass 
through the area, the expected number sighted (at least once) is 18 

D — F(1 — er SWL/A ). 

Suppose that the submarines fire torpedos upon sighting, and that the 
probability of sinking the target after sighting is P. Then the expected 
number of ships sunk per month with S submarines operating indepen- 
dently in area A is 


H, = F(1 -c- 8 ™- 1 ). 

Although the value of D (expected number of the F ships sighted as a 
function of the effort expended ) saturates rapidly as the number of sub- 
marines S and the length of tour of each L is increased (because multiple 
sightings increase), when the probability of sinking upon sighting is 
small, the number of ships sunk H t does not saturate so quickly with 
increased effort, because the multiple sightings provide multiple tries 
at sinking the same target. 

When the submarines operate as a group instead of attacking sepa- 
rately, all will respond to a sighting made by one. Assume all the S 
submarines will be able to "home” on the first one and get their chance 
at sinking the ship. In this case the probability that the ship is sunk is 
1 — (1 — P) s instead of the value of P which it had if only one sub- 
marine attacked. Thus, the expected number of ships sunk by the group 
method is 


H, = F( 1 - e - i* - « - ). 

The relative advantage of group action over independent action is given 
by R = H a /H t . If this ratio is tabulated against the number of submarines 
S, for various ratios of WI</A, it may be seen that, if WLJA is large, the 
number of submarines in the optimum group moves toward 1.0, but if 
WL/A is very small (vast area to cover), continually increasing the size 
of the group is beneficial. 19 For example, data on Japanese shipping in- 
dicated that about three U.S. submarines per group would give optimum 
results in the Pacific. After the analysis and perfection of the group tactics 
by practice, the yield per submarine in a group of three was about 502 
greater than the yield per independent submarine, confirming the analysis. 

This number should not be confused with the expert ed number of “ship-sightings” 
made per month by the submarines which can exceed F and which is given by 
^ SFXVL/A- The number of ship sightings would follow the Poisson distribution 
with the given 4>- 

See plot. Figure 1, Morse and Kimball, op. cii , p. 89. 
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In conclusion, many of the calculations involving search in an area 
depend upon more complex detection and attach rules than the simple ones 
used for illustration here. For example, the use of alternate detection equip- 
ment (e.g., radar, sonar, the human eye, or other devices) can change the 
probability that a target will be sighted as a function of the distance from 
the point of search. A rule for visual detection often indicates that the 
probability of sighting falls off with the cube of the distance between the 
target and the detector More complex rules hold when the relative move- 
ment of the target and the detector are known. 

To illustrate the effect of the detector upon the search process, consider 
the following practical problem of exploration in which a two-stage search 
process is used. This problem has been simplified somewhat for presentation, 
but it contains the essential ingredients of the industrial exploration process 
and the important concept of using several levels of search detail. 


Example, 
consider a 


A valuable prize is located in one of hvo areas. ( As a prize, 
mineral deposit, an oil or gas deposit, or a similar valuable 


| An 


Area 1 


Area 2 


□ 


“ta C Urea It ’Tmfo t 1™™ °’ i " """ “ one and •»>>' one 
S' aVa ' W,1,! °" "» «** location of the prize 

available ( 1 ) an ' ° r ox ’ ,s r,ot ^ fi0 ' vn i- Two search methods are 
(2) an experms c StaTUSST f' “flTh 

mu l b Xos **■« - ond wh,ch 

expensive method^ss'ith a^nir^lom'choiM^or'the^rst^ltMlc^ * " Se ‘ he 

METHOD , (for certain location in one stage) 

Total Cost (in millions) = f 1)(05) + (2)(05) = $15, 

°” ‘ h0 &St 1-oV. .he process 

ktaSS he th"™ (ie " holes must be drilled 

durfssoltetf " LT “ 5M ' ah aad ““.uisition prone- 

method is nolcertlmXr; 'fZ, "“T— » <“ 
"hieh indicate the presence of the „ri, 'I?' 1 S ' VK s T u,ious readings, 
occasionally fails to detect the prize svhe i!‘ “ n °' * ere - and a1so 
chances are usually charactcS l 2o ^7' ^ 

Electronic or other detertirm *. . • 0t detect,on equipment used: 
terrain or in an j3T5WSLT? d «"* — -gh 
b mmuKnt atmosphere can issue false 
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signals due to vibration, random noise, and the like-as well as provide 
true signals for only a fraction of the prizes passed over. 

Nevertheless, suppose we decide on the fb flowing plan (which for 
illustration is simple to avoid mathematical complexity) : Two looks will 
be taken in each of the two areas, and any positive signals obtained by 
the inexpensive method (true or false) will be recorded. If one area 
scores higher than the other, the certain method of detection and acquisi- 
tion will be used first in the high-score area. Should the area scores be 
tied, we will revert to the random use of the expensive method exclu- 
sively. In either case, if the first expensive look does not yield the prize, 
the expensive method will be used on the remaining area, to acquire 
the prize with certainty. 

We now turn to a cost evaluation of the second method. Let us 
assume the following data for illustration. 

A = the probability that the inexpensive look will yield a false positive 

reading = 0.1 per look. 

p — the probability that the inexpensive look will yield a true positive 

reading when the prize is there = 0.2 per look. 

With these data available, we can now compute (1) the probability 
that the prize will be acquired on the first expensive look, (2) the prob- 
ability that the prize must be located by random application of the ex- 
pensive look, because the scores are tied, or (3) the probability that two 
expensive looks will be required, because the inexpensive look gave a 
false positive reading. 

To summarize these computations, suppose the prize is actually in 
Area 2 of the diagram below, so that the probability of getting a signal 
(false) from Area 1 is A per look, and the probability of getting a signal 
from Area 2 ( true or false ) is A 4- p - Xp = a per look. The actual scores 
generated by these two basic probability values, which we assume to be 
constant regardless of the number of looks, will have the binomial dis- 
tribution. Thus, for two inexpensive looks in each area, we may sum- 
marize the nine outcome score combinations possible in the table below, 
which also shows the correct probability values for each possibility. 


Area 1 


Area 2 


Probability of a signal = x 


Probability of a signal s: X + p — Xp=s« 


FREE IS HERE 


Note: In Area 2, a signal may arise either because the prize is there, with 
probability p, or from spurious causes with probability Because only 
one signal is obtained per look, both causes do not occur together. 
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Case Xj X, P(X,,X 2 ) 


Appropriate formula 


1 

2 

3 

4 

5 

6 

7 

8 
9 


0 0 0.41690 

0 1 032659 

0 2 0.06350 

1 0 009331 

1 l 0.07258 

1 2 0.01411 

2 0 0 00318 

2 1 0.00403 

2 2 000078 


(l-X)-(l-.)’ 

2(1 - M’«(l - «) 

(1 - MV 

2(M(l-M(l-«)’ 

2(M (1-M«’ 
V(l~«)' 


Total 099993 (or 1.0000 to four decimal places) 


In the above table, X, and X- are, respectively, the scores that could 
be obtained by recording the results of each look in Areas 1 and 2. The 
presence of a positive signal on a given look is indicated by “1” and a 
negative signal by "0." 

A somewhat more appropriate display, for our purposes, is given by 
the summary table below, which shows the probability of each score 
combination more compactly. 


X, 

X: 0 

1 

2 

0 

0.41990 

0.32659 

0.06350 

1 

009331 | 

0.07253 

0.01411 

o 

OXXI518 

0.00403 

0.00078 


Pa - P(X, = Xj) = 0.49326 


From ether table we may now compote the three required probabilities 

P. P(X, >X,)= 0.40-120 The sum of all the probabilities nboce 
the diagonal, for the cases in which the 
inexpensive looks have correctly located 
the prize. 

The sum of all the probabilities on the 
diagonal, for the cases in which the score 
“ tied and we must revert to random 
Pc = P(X, < X , ) = 0.10252 S 1Cah ° n , tbe ex P ens ive look. 

.. , Sum the probabilities below 
^ or "’hich the inexpensive 
has located the wrong area. 

search procedure, which consists of ^h oS £££ 
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using the inexpensive procedure, followed by application of the expen- 
sive detection and acquisition procedure according to the rules originally 
set forth. 

Briefly, this cost will consist of the cost of the inexpensive search, 
$10,000 for each of four looks, plus the cost of acquiring the prize on the 
first expensive look, or the cost associated with the random expensive 
look, or the cost associated with the selection of the wrong area. Thus, 
in millions, the cost expected from Method II is 

Method n (for certain location in two stages ) 

Total Expected Cost (in millions) = (1) (0.40420) + (1.5) (0.49326) 
+ (2) (0.10252) + (4) (0.01) = $1.3891 

Comparison of Method II with Method I indicates that the second 
method reduces costs by 

1.5000 - 1.3891 = 0.11087 or about $110,870 

so that here the second procedure is preferable. 

Several observations can be made from this simple problem. First, if 
the reliability of the inexpensive search or detection equipment could 
be improved for the given cost, then the cost reduction indicated above 
would be increased substantially. In the same way, it might be worth- 
while to pay more per inexpensive look to get a more reliable result at 
that stage, a trade-off which suggests further study. 

Next, with greater cost reductions in the total cost of detection and 
acquisition, it will pay to look for prizes that are progressively less val- 
uable. Thus, if the costs of detection and acquisition could be cut to $0,5 
million, then it would pay to look for prizes above that amount-whieh 
would not be sensible under the present circumstances. Further, should 
we be able to exploit anything found from a false expensive look— as a 
consolation prize, or prize of lesser value than the one we seek— then the 
economics of the twO-stage search change again. Because consolation 
prizes are clearly desirable, a systems approach to the search operation 
would call for some thought about what could be salvaged from a false 
positive decision. (A broader search objective may be desirable if for- 
tuitous or "serendipitous” findings can be exploited, such as gas and ore, 
as well as oil.) 

As an example, suppose a consolation prize worth $0.5 million would 
be obtained if the expensive look were made in the wrong area on the 
first try. (We still assume that there is a much more valuable prize in the 
remaining area but its exact worth is immaterial in the following com- 
putation.) Then, under this new condition, a comparison of Methods I 
and II would give 
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method i (certain detection of main prize in one stage with possible con- 
solation prize) 

(1)(0.50) + (15)(05) = §1.25 million 
method n (certain detection of main pnzc in two stages with possible 
conso’ation prize ) 

(1) (0.4W20 + (1.25) (0.49326) 

+ (15) (0.10252 + (4) (0.01) = §1.2146 million 
The net difference between Methods 1 and II has now dropped to just 
over $45,000. Thus, in this case, the presence of the consolation prize 
not only reduces the total cost of both methods, but also makes the 
random approach relatively more attractive than it was before. 

Finally, we can see (although a proof wiU not be given) that the two- 
stage-search process becomes more attractive when there arc few or no 
consolation prizes, and when there may be a large number of no-prize 
areas (more areas than the two used for illustration above). In this case, 
there would be a targe failure total for the expensive looks, which the 
hrst-stage search could eliminate or reduce. 

Various methods have been proposed and developed for exploiting 
For IX S '”( Ch pro “* lres >" prospect, ng and similar applications. 

T' ' e * der sWuW “"'Ult the literature on the 
fcms of tWs e f V P "'- ent P u n»*s, Practical search proh- 

Search For Extreme Values 

The use of random search procedures for the location of tm c.p, , 
or set of extreme values, „ of .mmediatc interest E5trem<! Va ' Ue - 

can be madelf'lV dntocUtemJ’irm' 1 ft! "wf*" ° f se, 3 uenMS t 5 ** 1 
■nercascs. the brute” “ se ™h Thro ' IX* ^teases •* N 

sequence of N items that may have the^te^™' 16 */” * partic,llaI 
impossibility. Yet, this scheduling & atKt ' or ,0west cost is an 

Survey.” p,„ c . Fw 

ta. BaW,. Md., 1957. 1 , rddota, £ {£££“ I'T"' 1 ' S “«y ot Am„. 
rf&f toto. P,«n, r ,*?d L J. D, CT . - Stel J,Uon 

e l"’{ 1S «. « dllrir -MrtkJ „r J J ‘“ n °l Colorado 

01 Mining Eiploralion over T T.-o . eui ?r ol Appraising Economic Prmtwir 

ocemenr Science. 3. 4. UU «£? \T" ri ^Afgerun Sahara cSTstSPTS! 
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Ves, and the results are similar to those shown above. Suppose that N 
items are to be scheduled, and that it were possible to list all the IV! possible 
schedules, together with a measure of their cost . 21 

Suppose each of these possible schedules can be arranged in rank 
order by cost, so that the lowest cost schedule will appear first in the list, 
the next highest cost schedule next, and so on. (Schedules with tied costs 
are listed adjacently without regard to the particular schedule.) From this 
rank-order distribution, it is now possible to determine the probability that 
an optimum, or near optimum, schedule will be found by random sampling, 
or search. Although, in general, this rank-order distribution will not be 
known, in many cases it may be approximated to obtain the same results. 

To illustrate, suppose that N! possible schedules of N items have been 
made, evaluated, and arranged in rank order by cost. We may want to 
find a schedule in the top 10% or top 2% of this list, let us say in the top 
fraction p. Because the total list is not available, we generate a set of 
schedules n at random and rank the sample set in order, taking the cheapest 
sample schedule as the “best” schedule available. This rule gives a geometric 
reduction in variety to combat the factorial. Thus, the probability that one 
randomly constructed schedule will be one of the top fraction p, is simply 
p (O^p^l). The probability that a sample of n will not contain one 
schedule of the top fraction ,, is (1 - p)"antl therefore the probability that 
it will contain at least one member of the top fraction p is 1 (1 P ) 

It is interesting to note that the result is only indirectly dependent upon 
the total size of N1 (the population) by the speoficatron of p. To illustrate, 
suppose that a sample of n = 100 is taken from a much larger popu hhon 
(2VI), so that the sampling may be considered random. Then, if we ask if J 
the probability that at least one of the n items will be in the top 1% (p 
0.01), using logarithms we End the answer 0.633a, as follows. 

(X _ p) = 0.99 = 9.9 X 10-’ 
log(0.99) = log(9.9) - 1 = 0.995&1 - 1 = -0.0W36 
100 log (0.99) = 100 X -0.00436 = -0.436 

To find the antilog of -0.436, we note that 

—0.436 = 0.564 — 1 


so that 


* X HH = 0.3665 


(0.99 ) ,0 ° = 10® * 

Thus, the probability of not picking one member of the top K is 0.3(^and 
the probability of picking at least one member of the top 1% is 1 0.36& 

0.6335. 

__„ tf Tv* assessed in many ways. However, if we restrict our 
“The cost of a ” ks tathe seooenee only (following a Maikov chain assomp- 

s^t^^devcoped flom a ff X * »a«* of pair costs, 

applied pair by pair. 



oviewicu 


l COirfJi^UEJ 


In the same way we could find that the probability of picking a member 
of the top 15 with a sample of n ~ 1000 is 0.99563, and this probability rises 
slowly with further increases in n. 

The reader may complain correctly that the sample of 1000 does very 
little for the probability of finding a unique optimum in a set of N\ equally 
likely possibilities. For example, if N — 100, reference to standard tables 
gives log, 0 1001 = 16Q, so that 100! = 10'*°. One percent of 10’ 80 is still a 
very large number, 10 !!S , which contains the unique optimum. In fact, the 
probability of finding the unique optimum with a sample of n = 100 is ap- 
proximately (100/1001)= 10-' a finite, yet infinitesimal probability of 
success. 


How is it then that this sampling method works in practice? It is 
important to note that if each of the 100! sequences (or items) had a distinct 
value, and therefore were equally likely, no search procedure that required 
examination o/ each and every case would he beneficial-or physically 
posable. However, two factors work m our favor. First, we seldom encounter 

hJofert!” m "*■?* r« Se °' d “ UnchM * « so great that we cannot 
a ranee of !7^ m<Merence (see Chapter 7) and be satisfied with 
anneeofposnb, teres that are suBcremly good as to be called optimum, 

l ch e amni?;t'ir e!P0 " *° the de,ecll0n concept in the area- 
te t C e r The ,an1e of« P ,r V , 10 ?. scct,on ' Tbe ' ar 8“ *««*!«> area, or 

Second the distnl ur r j C Dry ® nsw ers, the easier the search becomes, 
clustered so that somp art factions is seldom uniform, but is usually 

consSboSs^™XS": 01 l h ° 01 “ 

sampling to an aJconfallg 3 " d » *«'**»* 

and ttat the values oTthes S ' , '"" ces co ' ll<1 be generated at random 
histogram. Suppose also th^t Xs hkmtrr ° UW ^ fJ,° Ued in a man y- ste P 
curve with i = 1000 and v = 100 •nJft. am T?* be Etted by “ nonni >’ 
tell us that the probabtlity of fiodmv tal0 "’ led 8 e of tbi s pattern would 
equal to i - = ” End n 8 a sequence of cost less than or 

size to obtain at least one sen,, ' u"f W ° C0U d adlusl our sample 
because of some structural ,bat value. Moreover, if, 

how to generate sequences that fain °l lbe , cos,s involved, we know 
restrict our investigation m th"t 'f i '! T.,' , '“ Mst (767 >' 
even better results. ^ n the population, and achieve 


rr sTs . c T BEHADEoHAGBoup o-o~ s 

individually. It may^o^tmr! IhaUh^’"^ b ' f ° U ” d I " ust be isolated 
a< U,e oondition Of one item In the set 
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In the same way we could End that the probability o! pieldng a maj« 
of the top IS svith a sample ol n — 1000 is 0.99563, and this probability ns 
slowly with further increases in n. , -v 

The reader may complain correctly that the sample of 1000 does r 
little far the probability of finding a unique optimum in a set of A 7 1 equa y 
likely possibilities. For example, if N = 100, reference to standard ta 
gives logic 1001 = 160, so that 1001 = One percent of 10' 40 is stiU a 
very large number, 10™, which contains the unique optimum. In fact, t e 
probability of finding the unique optimum with a sample of n ~ 100 15 a P* 
proximately (100/1001)= lO - ***, a finite, yet mfinitesimal probability o 


success. 

How is it then that this sampling method works in practice? It is 
important to note that if each of the 1001 sequences (or items) had a distinct 
value, and therefore were equally likely, no search procedure that required 
examination of each and every case tcould be beneficial— or physically 
possible. However, two factors work in our favor. First, we seldom encounter 
a problem in which the range of distinctions is so great that we cannot 
invoke the concept of indifference (see Chapter 7) and be satisfied with 
a range of possibilities that are sufficiently good as to be called optimum, 
or successful This corresponds to the detection area concept in the area- 
search examples of the previous section. The larger the detection area, or 
the larger the range of satisfactory answers, the easier the search becomes. 
Second, the distribution of distinctions is seldom uniform, hut is usually 
clustered, so that some advantage may be taken of the knowledge of this 
constraint both in specifying a satisfactory answer, and in restricting random 
sampling to an area containing the optimum. 


Example: Suppose that N! sequences could be generated at random 
and that the values of these sequences could be plotted in a many-step 
histogram. Suppose also that this histogram could be Stted by a nomial 
curve with I = 1000 and . = 100. Then knowledge o! this pattern would 
tell us that the probability of Ending a sequence of cost less than or 
equal to * - 233* = 7ff? is about B, and we could adjust our sample 
sire to obtain at least one sequence below that value Moreover if 
because of some structural knowledge of the costs involved we know 
how to generate sequences that fall below this cost (767)' we could 
restrict our investigation to that portion of the population, and achieve 
even better results. 


WHEN TESTS CAN BE MADE ON A GROUP OF POSSIBILITIES 

Wc have previously assumed that an optimum to be found must be isolated 
individually. It may also be true that the condition of one item in the set 
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of possibilities to be examined will provide a group test. When this is 
possible, the binary splitting method may be employed in a series of yes-no 
tests which greatly reduce the number of tests required, or which in com- 
bination with some individual testing will isolate the required item. 

For example, as mentioned previously, the number of yes-no questions 
required to isolate one item from N is the next integer larger than log 2 N. 
Thus, if it were possible to use a binary test procedure to locate an item 
from a set of 100! possibilities, the number of tests required would be log 2 
1001 Because log I0 1001 - 160, and log* N = 3.322 log, 0 N, 160 X 3.322 = 
532 (rounding off to the nearest integer) tests would be the minimum re- 
quired to guarantee isolation by the binary procedure if it could be applied. 
This result again indicates the power of the binary questioning method when 
it can be employed. 

Let us take a case, however, which does not easily permit complete 
binary testing, but which, nevertheless, permits a group test to be made. 

Example: Suppose that blood samples have been taken from N persons, 
and that each of these samples has been numbered, for later identifica- 
tion. Then, suppose a test (which will have a yes-no, or positive-negative 
result) is made by pooling a fraction of each blood sample in an over- 
all group test. This may be done, for example, by taking half the blood 
in each subject's test tube and pooling the blood so obtained. The re- 
maining blood samples will not be pooled, but will be left in the original 
distinctly identified test tubes. If we are looking for patients who have 
a certain disease. For which the test has been designed, and if the pooled 
test will turn out negative if and only if all of the individual samples 
pooled are negative (one positive sample will cause the pooled test to 
be positive ), we have the following results. 

Let p = the probability that a patient has a positive individual test 
n = the number of patients pooled 

Then, the probability that the pooled test will fail (requiring individual 
tests of the n patients set aside for future reference) is again I — (1 — 
p) n . If there are N individuals to be tested, and if c, is the cost of a group 
test and c t is the cost of an individual test, then the expected total cost 
to test N patients by this procedure is 

Total Expected Cost /JV\ /N\ , , 

(group of N patients) “ c ‘ (n ) + \n ) 0 ' " p) 1 

where the first term on the right-hand side is the cost of group testing, 
and the second term is the cost of testing n individuals in the expected 
number of groups that fail the group test. This equation may be mini- 
mized for a given value of p by variation of n. It will be seen that, if p 
is small, the optimum value of n will be large, and vice versa. To illus- 
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trate. suppose p = 0.1, e, - e> 51°. anli N 

various dioices of n ~ 1> 2* 3, 4. and 5. 


: 100. We now tabulate 


Group cost 
$1000 
500 
333 
250 
200 


Individual cost 

( 10)(2)(5G)(0.19) = 
(10>(3){33>(0.2?) = 
( 10){4)(2S)(034) 3 
(10 1(5} (20) (0.41) = 


Total cost per 100 
$1000 
690 
603 
590 
610 


The lowest cost is $590 per 100, or $590 per patient, and represents a 
reduction m the cost per patient to leas to 6<B of the indmdual patient 
cost « This {otto ©f testing is suitable when we can exploit the constraint 
that all the rediuduals must be -negative" for the group to be “negative, 
and when such a group test can be made without examining the in- 
dividuals. 


An oft-repeated logical problem presides another example of group 
testing in which the number of tests may be reduced to one, by the exploita- 
tion of constraints. 


Example: A sheik receives eight tnba! representatives, each of whom 
is to pay taxes for his tribe Payment is made m gold bars, weighing 
exactly one pound each, and each taxpayer places his payment in a 
separate pile before the sheik so that each tribe is identified by position. 
Bdorc the gold bars are gathered together {or the treasury, the sheik Is 
informed that one (and only one) of the tribes has cheated by shaving 
exactly one ounce of gold from each bar tendered in payment. However, 
before he can reveal the tnbe, the informer is murdered by an unknown 
assailant With this information available, however, the sheik can easily 
determine the guilt) tribe For this purpose, he has a scale calibrated in 
pounds and ounces, and in one neigh ing he is able to isolate the culprit. 
How is this done 3 

The solution to this problem exploits both the arrangement of the 
piles, which is one constraint, and the knowledge of the weights of the 
good and the defective bars which provide enough additional constraints 
to produce a unique solution. Assuming that each pile contains eight or 
more ban. the sheik places on the scale one bar from the first pile, two 
bars from the second, three bars from the third, and so on for all eight 
piles. The ounces of shortage indicate the pile containing the shaved 
bars, thus solving the problem in one test. This result is considerably 
better than the binary split procedure, because the system of gold piles 
has been treated as a single eight-state problem instead of a series of 


" frebohlU, 7 1„ eni U, Ap- 
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two-state problems, and because the facts of the case permit direct iso- 
lation of the one defective state in eight. 

To see the effect of the stated constraints, it is worthwhile to consider 
the procedures the sheik might follow if ( 1 ) it were known that only 
one of the gold bars in one of the piles had been shaved one ounce, or 
(2) it were known that some of the gold bars in one of the piles had 
been shaved one ounce each, or (3) it were known that there were one 
or more bars shaved by one ounce somewhere in the collection of bars. 
In (1) and (2), a minimum of three weighings using the binary split 
procedure would be required to isolate the defective pile. In (3) the 
number of weighings would depend upon the way in which the defective 
bars were distributed in the piles. (How many weighings would be 
necessary if there were exactly one defective bar in each of the eight 
piles— and this fact were not known beforehand?) 

Although the examples of this section have been chosen for their dra- 
matic value, group testing often occurs in practice before the investigator 
sets to work. For example, in a system of N components or blocks, each of 
which must work for the system to operate, knowledge of system malfunc- 
tion is essentially the result of a group test that failed. Similarly, when a 
system is subjected to a response test, that test is a group test, which for 
the tested input-output relation requires all the blocks in a given chain to 
be operative. For this reason, a preferred first-response test for a complex 
system is one that will require the correct operation of all the system blocks, 
because if this test passes, no further testing may be needed. 

When a group test fails, we may then employ a series of block or in- 
dividual tests as indicated by the failure symptoms or by the logical require- 
ments for isolation, as previously described. 


SEARCH WITH PARTIAL INFORMATION 

We now present two selected search problems that illustrate how partial 
information may be exploited to improve the search process. 

The Search for a Unimodal Optimum 

If we know that an optimum set of conditions for a system exists within a 
certain range, and that the relationship between a given adjustment and 
the optimum is “unimodal," i.e., that the adjustment will cause the system 
to rise steadily to its optimum condition, and that further adjustment will 
cause a steady decline from that optimum, we have extremely valuable 
information to aid in the search process.* 3 

» The normal curve, for example, is unimodal, with the mode at the average. The normal 
curve falls away steadily on both sides of the average, and has no lower peaks. 
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For example, suppose a systems output/mput ratio (yield) can be 
varied by a change of a smglc parameter ( say, temperature ) x. The relatton^ 
ship between x and the yield f(x) is not Wvn completely, bu^suppose 
it is known that optimum adjustment of x lies within the range 0 — * ’ 

and that j{x) can be evaluated within the specified range ^of x at selected 
poinb. Finally, suppose that the yield f(x) is “unimodal” i.e. there is a 
single optimum x„ such that x 0 is within or at the boundary of the interval 
0 to L, and that /(x) is strictly increasing for x^Xo and strictly decreasing 
for x > x. (or else strictly increasing for x < x„ and strictly decreasing for 
X^X„). 

Under these conditions, the following steps offer an optimum experi- 
mental search procedure, which we wall first describe graphically, and then 


improve by more detailed specification. 24 

Suppose /(x) looks like Figure 10-7 a within the interval 0 ^ x L- 
The curve shown is "ununodal" as required [which means there is only one 
hump or peak in the curve for f(x)]. Because the value of f(x) must, by 
this assumption, steadily decrease on either side of the optimum x 0 , two 
initial measurements, x 2 and x 2 > x u can produce one of the following 
three outcomes 


f(x ,) > f(xi) The optimum must he to the left of x 2 , and the region of 
search can be restricted to the interval 0 to x 2 . 


/( *, ) < /( x* ) : The optimum must lie to the right of x, and the search can 
be restricted to the interval Xi to L. 


1 (* 1 ) ~ fix;): The optimum is m the interval x, to x 2 . However, to be 
consistent with what follows, we may restrict further search either to the 
interval 0 to x t or r, to L. 


The net result of the first two experimental measurements is to reduce 
the interval 0 to L, to one of the smaller intervals indicated above. Within 
that reduced interval, there will be one measurement that can be used in 
die next step. For example, in Figure 10-7a, the test shows the optimum 
is in the interval 0 to x z . One added measurement in that interval, say 
O' — z > allow the test to be repeated again, and so on thereafter, 

until the optimum x 0 can be located within a narrow interval, which may be 
made as small as desired by continued experimental measurements. 

The above procedure will work if the location of the experimental mea- 
surements is chosen arbitrarily. However, the following rules will provide 
an optimum selection of these points. 

Suppose that it is desired to locate the optimum x, nothin a narrosv band. 
Then sse can divide the interval 0 to L into a number of small segments 


wllbitit proof, for vhich «re R. E . B*» and 
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10-7 Procedure for optimal unimodal search in one dimension. 

whose width equals that band or interval of isolation. Thus, if we make F 
divisions, the isolation interval width is L/F, and we can, if we chose, 
measure the length of L in those units, i.e., five equal intervals of length 
LI 5, etc. When this decision has been made, we can consult the following 
table to determine how many experimental measurements n will be needed, 
to provide the required isolation. 

F: 1 1 2 3 5 8 13 21 34 55 .. . 

n: 0123456789... 
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In general, the relationship between n and V is 

F(n) = F{ft - 1) + F{n - 2), F(0> = W “ °- 

The latter requirement indicates that one experimental measurement does 
no more good than none m locating the optimum. (We will continue and 
explain the table momentarily However, it may be seen that any value ot 
p in the table is equal to the sum of the previous two values of f \ } 

Let us consider first the case of isolation into one of two intervals, 
{p _ 2, n » 2). The problem is first to select the position of the two mea- 
surements required, x, and x. The rule for optimum selection of these two 
values is” 

x, — F»-s and Xj — F,. t 


where for n ~ 2, the appropriate values of F are obtained from the table. 
In this case, the values for x> and x : from the table are both 1; le., both 
measurements should be made at the midpoint of the interval. (The method 
attempts to divide the interval into equal parts in this case. ) However, two 
measurements at the same point will not reveal anything about the location 
of the optimum in the first or second partt Therefore, suppose the trial value 
for x, is oSset by a very small amount «, so that x> — 1 ~ t and S 3 — 1. With 
this slight modification, the two test measurements will indicate that the 
optimum is either in the interval (0 to 1) or in the interval (1 — c to 2), 
where again we has e measured x in the number of divisions of the interval 
0 to L. Because < can be made arbitrarily small, the problem is resolved by 
this device. (It is important to understand this case, because if F is greater 
than 2, the final test always resolves itself into the one described above.) 

To improve the precision of" the isolation, we may now ash for the 
interval 0 to b to be divided mto F = 5 parts. From the table, n — 4, For 
the first two trial values x, and r. we hav e 


- Tv-. - 2 and r* — F,_ 


= 3 , 


where the interval count is based on divisions equal to % of L. Figure 10-7 
(e) illustrates the first two selected measurements, from which we may 
conclude that the optimum (for the case shown) lies to the left of x 3 and 
vs in one of the three segments numbered 1, 2, and 3. 

. ''' "T, hlVe 3 <1 ' tK 'r art t>rol*n>. Again, apply the [ onnu ] a f or the 
'ocation ot the n«t meamrementt. For (he three-pan problem, measurements 
should bo at locations 1 and 2. the latter already nistio. e ,g., x_ vriH be at 
location I. Now we see the optimum is above x,. 


the opuaalw 

ttSSttSZittZL w to ■ E - *- * — ■ •*» * 
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We now again have a two-part problem, with an existing experimental 
measurement at the midpoint between intervals 2 and 3. The last measure* 
ment x, could then be made at that point, but offset by a small amount t, 
as before. Application of the test as usual will now isolate the optimum in 
one or the other interval, interval 2 in this case, within the error 

This procedure can be used for isolation to any degree of precision by 
increasing F, and always works for one-dimensional optimization. Although 
the series of values for F, which is a function of n, increases slowly at first, 
it increases rapidly for larger values of n; e.g., F(n = 20) is greater than 
10,000. Thus, the position of a maximum can always be located within 10' 4 
of the original interval length (0 to L) in at most 20 observations x t to x 2 o, 
selected as described above. For large values of n, F(n) is given accurately 
by 1.45(1.62)" for this procedure, 26 which may be compared with the 
equilavent binary procedure for isolation when group tests can be made, 
i.e., F(n) = 2". Thus, although the method described is not quite as good 
as the binary split method, it provides geometrical cuts in variety in many 
cases where the binary split approach is not practical. 

The problem of determining optimal search procedures for locating the 
absolute maximum of a function of N variables is very difficult, and (al- 
though we will sketch out one extension) further discussion is beyond the 
scope of this Chapter. 27 

Hill Climbing Methods 

The experimental determination of “optimum” system response usually 
involves a series of experimental perturbations, or changes, in system condi- 
tions in small steps. The steps are generated and examined sequentially. The 
changes that produce the greatest increase in response guide the investigator 
to a new set of conditions, and so on, until the optimum is reached. This 
method is known as "the technique of steepest ascent." 

Example: Suppose the output z of a chemical process depends upon 
two variables: Temperature x and pressure y. The system response z = 
f(x, y) is a three-dimensional surface, or a two-dimensional contour 
map, as shown in Figure 10-8a- 28 The contours shown in this hypothetical 

” See Bellman and Dreyfus, op. cit., p. 155. Hie procedure described above can easily 
be extended to cover the location of an optimum when x can take on only integer 
values, as shown by Bellman and Dreyfus on page 156. _ _ 

11 Bellman and Dreyfus, op. cit., provide selected references to this point, pp. 178—179. 

For a survey of experimental procedures, see O. L. Davies (Ed.), Design and Analysis 
of Indtistrial Experiments, Hafner, New York, 1954, Chapter 11. 

28 Compare Figure l0-8a and 10-8b with Figure 7-8. The contour map also may be used 
to show three variables, when drawn in three dimensions. A number of interesting 
experimental surfaces are illustrated in this way in D- 1*. Davies, op. cit.. Chapter II. 

For a popular expository article with a bibliography of similar Search methods, see 
J. M. Idekohn, “Ten Ways to Find the Optimum,'’ Control Engineering, June, 1964. 
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10 -* Contour maps of a response surface Illustrating the steepest response, or hM 
climbing, method el search lot optimum conditions, (a) has e single optimum. Con- 
tours are shown os a percent of the true optimum (which is unknown in General) Search 
fe confined to the “region ot interest,” selected in advance from previous knowledge- 
With the single optimum, the method ot steepest ascent leads segue filially to the opti- 
mum. With the multiple opttma ot (to), the terminal point depends upon the starting 
point, and several starting points must be tried to Isolate the global optimum. 


surface connect points of equal yield on the response surface, and are 
marked in percentages of the optimum { which is within the 90% contour 
and not shown). Although the surface is shown for illustration* its shape 
and the location of the optimum are not known, and experimental 
observations are expensive, so that extensive systematic measurements in 
the region of interest f limited to the rectangle bounded by ar„ and y 9 ) are 
not feasible. (If observations were inexpensive, %ve could systematically 
make a grid ol observations or take a large number of random observa- 
tions in the region of interest and develop Figure 10-Sa in this way.) 

To start the steepest ascent method, assume that an experiment consist- 
ing of four observations about the pwA P has been conducted. If the mot 
of measurement is small compared to the slope of the response surface, the 
direction of maximum ascent can be estimated from the experimental points, 
suggesting an area around point Q for further investigation. The experiment 
is then repeated about point Q, and so on up the hill. In general, as such 
a sequence of experiments continues and the investigator comes near the 
top of the hill, the surface will tend to Batten out, or hump over and call 
for more careful determination ot the path of steepest ascent. (Typically 
linear estimates of the path are made initially. Upon coming near the 
optimum— a 'near stationary point’-higher-order estimates, such as para- 
bolic path estimates, are used. The computational details of this procedure 
are given by Davies in the references cited. ) 
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The hill climbing or steepest ascent method works for continuous re- 
sponse surface with a single peak. Should there be several peaks, the 
terminal point will depend upon the starting point of the search, so’ that 
a series of starting points is often used when multiple peaks are suspected, 
Figure 10-8(b), and the maximum of the maxima found by this method is 
taken as the global optimum. 19 


DYNAMIC PROGRAMING AS A SEARCH APPROACH 

To conclude our selected search examples, we turn now to the procedure of 
dynamic programing, 30 which also seeks an optimum in stages. 

There are many systems optimization problems that may be treated 
either as one search effort involving many combinations or, alternatively, 
as a series of search problems, each involving a smaller number of combina- 
tions. In many cases, the latter choice is beneficial in reducing the computa- 
tion and evaluation involved in finding the optimizing conditions. 

Example: Suppose a lost object is located in one of three areas, labeled 
1, 2, and 3. The probability that the item will be found in a given area 
is given in the table below. Suppose that the cost of making an observa- 
tion varies from area to area; this cost is also shown below. In addition, 
suppose that the probability of finding the object (if it is in a given area ) 
is 0.1 per observation; i.e., detection is not certain, but will improve as 
the number of “glimpses” increases. If a total of $30 is available for 
search, how should the glimpses, or observations, be allocated to the 
three regions to maximize the probability of finding the lost object? 


Area = i 



I 

2 

3 

Probability p,- that object is in area i 

0.5 

0.3 

0.2 

Cost per glimpse w. 

$10 

$ 5 

$ 3 

Glimpses allocated n. 

ni 

n 2 

n 3 

Resources allocated n, to, 

TliWi 

TXiWz 

n 3 to 3 


“The philosophy of taking a route to the optimum that follows steps of ever-improving 
response (increases in the obj'ective function of Chapter 8) is adopted in a number of 
search methods. For example, it is used in linear programing. Figure 7-4, which, with 
linear constraints and a linear objective function, leads to the optimum if it exists. 
Similarly, searches through decision trees may proceed in this way, although several 
alternative starting points or initial search directions are usually required. For an 
illustration, see Egon Balas, “An Additive Algorithm for Solving Linear Programs 
with Zero-One Variables," Operations Research , 13 , 4, July-August, 1958, pp. 517-545. 

“See R. E. Bellman and S. E. Dreyfus, Applied Linear Programing. Princeton Univer- 
sity Press, Princeton, N.J., 1962, especially Chapters I and II. 
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H vre consider area 1. the detection probability is given by 
ability that the item is in a given region p. times the Ft***, 
the tt«m will he detected B it is in ™S™ .. and ", ghropse have hee^ 

made there. The latter probability is given by (1—0.9 

the probability that n, glimpses svdl fad to delect the object, tf 

present. Thus, the probability of detection in region i with n, glimpses 

P,{n,) * (p.) <1-0.9*'). 

(There are three such equations, with i = 1, 2, 3, representing the situa- 
lion to each tegion.) 

The pTOblem may now be formally stated as 

Maximize 52 Pi ( n i) {{=1,2,3) 


Subject to 52 i»it0» ~W (i ~ 1, 2, 3) 

where W is $30, the total resource available. These two equations simply 
say that we should obtain the highest detection probability possible for 
the three areas as a whole, hut that we cannot spend more than the 
total resources available, $30, Because the lost object must be in one 
and only one of the three areas, they may be considered independent, 
and the probabilities add, as shown, presenting a typical resource alloca- 
tion problem, which arises frequently in systems analysis and design. 

Before continuing, it is worthwhile to find the maximum number of 
glimpses that could bo bought in each region if the total resource were 
spent there. Division of the total available by the cost per glimpse for 
each region { \V/u:< ) gives this result as a maximum of three glimpses 
in Area 1, six glimpses in Area 2, and ten glimpses in Area 3. 

One way to proceed with the solution would be to enumerate all 
feasible combinations of the ni values, within the $30 limit, evaluate the 
total detection probability, and select the combination of n< that gave 
the largest detection probability for $30. Some thought will reveal, how- 
ever, that even for this simple three-area problem such a tabulation 
would be extensive, and that for a many-area, large-total-resource 
problem, the table could get out of hand entirely. 

We therefore think of attacking the problem in stages, first consider- 
ing Area l, then Area 2, and finally Area 3. To do this we will consider 
out specific problem as a special case of a more general class of alloca- 
tion problems, i.e., imbed" our problem in the larger class, by assuming 
that the total resource available is not fixed, but may vary from 0 to \V. 
When this is done, the detection probability in a given area will be a 
function of the resource available (because whatever the resource maybe 
it limits the glimpses that can be made). With this idea in mind, we 
create a tabulation in which we first consider Area 1 only and compute 
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the probability of detection for various resource levels from 0 to W. 
(In Table 10.1, only steps of $10 have been shown for simplicity.) 

Table 10.1 Stage 1 


w 

Possible tii 

/.=?.= (0.5) (1 

0 

0 

0.000 

10 

1 

0.050 

20 

2 

0.095 

30 

3 

0.135 


We stop tabulating W at $3Q, because this is the total resource available. 
For future reference, we call the one-stage probability of detection 
for each value of W. The value f t is the maximum detection probability 
that can be bought with the W available at each increment in Area 1. 

Turning to Stage 2 of our computations, we consider the effects of 
allocating various levels of the total resource W to the two areas (Area 
1 and Area 2) in an optimal way. For clarity in this computation, we 
show in complete detail all figures in Table 10.2, but we note that, in 
fact, only the lines with an entry in column (6) are of any interest. 
We allocate a given level of W to the two areas, considering Area 2 
first. The number of glimpses allocated to Area 2 is progressively in- 
creased. Any left-over resources are allocated to Area 1, and we pick 
up the largest or maximum detection probability available to us from 
that allocation to Area 1, namely, fi(W) from Table 10.1. Finally, in 
column (6) Table 10.2, we show for Area 1 and Area 2 the maximum 
detection probability possible for the specified level of W. This process 
continues until W = $30, when the tabulation ceases. 

In Table 10.2, column (1) shows the level of W allowed as we step 
to higher values. These steps are now in $5 increments, because the cost 
per glimpse is $5 in Area 2. Then, in column (2) we show all possible 
values of n 2 , from 0 to the maximum possible. In column (3) we show 
the resources left over (W — n 2 u> 2 ) . Column (4) shows the detection 
probability in Area 2 for the value of n 2 specified on the same line. 
Column (5) shows the detection probability in Area 2, which is the 
maximum possible for the resource left over (W — n 2 u? 2 ). Finally, column 
(6) shows the sum of columns (4) + (5), which is the maximum for any 
level of the total resource W. In short, the entries in column (6) repre- 
sent the maximum detection probability available if the total resource 
\V is allocated to Areas 1 and 2. For future use, we may discard all 
lines in Stage 2 except those with an entry in column (6), and produce 
a “Condensed Stage 2 M table, which more clearly illustrates this fact 
(see Table 10.3). The notation f t indicates the maximum two-stage 
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because the working system provides evidence that the “new” modules 
are all in working order at the time of substitution. 

The reader will realize that the Hock substitution procedures just 
described are essentially different ways of implementing the signal-tracing 
and signal-substitution methods already discussed. For example, the sub- 
stitution of a “good" system from the input to a given point applies a 
correct input to that point, and is equivalent to synthetic signal substitution. 
Conversely, substitution of a “good” system from a given point to the 
system output is equivalent to probe insertion at that point, as required 
for signal tracing. The approach chosen is thus a matter of physical 
convenience. 

In this section we have considered a number of fault-search methods 
widely used in hardware systems for trouble-shooting. These methods, 
however, arc of general use, and, as an exercise, the reader should attempt 
to cite examples of the general procedures in specific fields. Under what 
conditions, for example, can the signal tracing procedure be successful? 
When is block substitution preferred to signal tracing? What additional 
information might improve the trouble-shooting sequence? How can these 
data be organized? How could these trouble-shooting procedures be 
automated? Consideration of such questions will considerably improve 
the reader’s appreciation of the approaches cited. 

It is also worthwhile to note that even rudimentary training in the 
suggested procedures often leads to great improvement in the trouble- 
shooting ability of maintenance and “special project" personnel. 


INFORMATION SEARCH PROCEDURE 

Information retrieval is another example of the search problem in sys- 
tems analysis In this section, we present a brief survey of the procedures 
used, and note that the forms of search indicated may be expressed also 
in the Boolean Algebra format of Chapter II. 


The Rem-Characteristic File 

information files are compos o[ a fe, o( !lem , ( suc!l as hoofa attjc l« 
tad parcel!, ant! individual names) that have certain characteristics 
(such as topics covered, key words, attributes, and sltrtlv).- 

For example. a file eomisting ol eight documents described by six 
toy words {which either appear or do not appear) represents a simpli- 

t2SSSi«ww:t 
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Bed file. For simplicity, the documents will be numbered (coded) 1~8, 
and the key words will be coded A-F. (The codes used may be structured 
to have meaning, as indicated in Appendix C and the reference below.) 
The associations of characteristics with items, i.e., key words with docu- 
ments in this file, may be displayed as in Figure 10-3, which is a term' 
document or characteristic-item matrix for the hypothetical eight docu- 
ments and six key words. A “1” in a given row indicates that that document 
can be described by the indicated key word. Similarly, a “1” in a column 
indicates that that term describes the indicated document. 


Item Retrieval 

Documents having the desired key words (or items having the desired 
characteristics) may be retrieved or extracted from the file in various 
ways. 

If we can operate on the rows or columns of the term-document 
matrix C of Figure 10-3, which may be possible if the matrix is not 
too large, we can find the documents directly by two procedures, which 
we will illustrate by example. 

Example: Suppose that documents containing both key words A and 
B are desired. Then if the elements of columns A and B of Matrix C 
are multiplied together cell by cell using logical multiplication (1*1 = 1, 
all other combinations equal 0), the required documents will be 
indicated by the Is that remain in the calculated result. Thus, 


Documents J 

A 

B 

c 

D 

E 

Key words 

F 

1 

1 

0 

0 

1 

0 

0 

2 

1 

1 

0 

I 

0 

0 

3 

0 

0 

1 

1 

1 

1 

4 

0 

1 

1 

0 

0 

1 = Term-document matrix C 

5 

1 

0 

0 

1 

1 

0 

6 

0 

0 

I 

0 

1 

0 

7 

0 

1 

0 

1 

0 

1 

8 

1 

1 

1 

0 

0 

0 


10-3 A typical term-document matrix C. In this format, documents are coded 
numerically and key words alphabetically. Each row refers to a document and its as- 
sociated key words. Each column refers to a key word and its associated documents. 
(It Is equally possible to work with the transpose of this matrix by the Interchange of 
row and column operations described In the text.) 
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Documents 

A 

B 

(A-B) 

1 

1 

0 

0 

2 

I 

1 

1 

3 

0 

0 

0 

4 

0 

l 

0 

S 

1 

0 

0 

6 

0 

0 

0 

7 

0 

1 

0 

6 

1 

1 

I 


and documents 2 arid 8 are the ones desired. {In Chapter 11, this 
form of manipulation will be expanded to include other forms of 
logical combination of characteristics, which will then be called logical 
statements having the value True or False. ) The procedure provides 
a direct matching of key words with documents. 

Example: Another simple procedure that provides a count of the 
number of matches between the desired key words and those present 
for each document uses ordinary matrix multiplication by a column 
vector composed of 0’s and Vs to indicate the desired key words. For 
example, if documents having key words B, C , and F were desired 
we could form the column vector for key words, which has l's in 
positions 8, C, and F and 0's elsewhere as shown below. Multiplication of 
the term- document matrix, C, by q. in that order, produces the column 
vector r, which describes the number of matches for each document. 


1C} X 


Key word 


B 

C 

D 

B 

F 


•O' 

Document 

=* 1 


1 

2 

i 

1 

3 

2 

0 

4 

3 

0 

5 

0 

1 

6 

I 


7 

2 


8 

2^ 


1" ttw raampte. d ' lcum ™> 4 matches the three key words sped- 
l, °'' neT - documents match one 
or two of the specified key .cords, and may therefore be of some 
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interest. In efficient retrieval systems in which the keywords are 
precisely applied, documents which have the highest value in the 
computed column vector r will be the most relevant. (In practice more 
complicated computation forms are often used to give a relevance 
score for each document in the set, as discussed briefly hereafter and 
extensively by Salton. 8 ) 

Three File Search Methods 

Three forms of search are often used m extensive files, namely, (1) 
search by item for given characteristics, (2) search by characteristic 
for items so described, and (3) search by short tables of characteristic 
with cross references. Each of these methods which we will illustrate 
by using the term-document matrix data of Figure 10-3, is suitable for 
computer use. 

Example: A file may be organized by listing items as a heading, and 
sublisting the characteristics. This produces an item-characteristic file. 

A search on the item, or document in this case, can be made down the 
document list. If the desired key words are present for a given document, 
that document number is extracted. Such a listing of documents need not 
be in any particular order, but the search for all relevant documents 
must encompass the entire list. Physically, such a file organization and 
search procedure corresponds to the search for items by examination of 
every item on a computer tape, or, in the manual file, to the search for 
a sub-set of desired items from a set of edge-punched (McBee) cards. 
(See Figure I0-4a. ) 

Example: Conversely, a file may be organized by listing characteristics 
as a heading and sublisting items having that characteristic. Again, 
with our previous definition, this produces a characteristic-item file. In 
this file, the characteristics (key words) must appear in a given order. 
Here we assume an alphabetical order of key-word codes for efficiency. 

To find the required documents, we first place the specified key words 
in alphabetical order. Then, we consult the first specified key word and 
note the relevant documents. Next, we consult the second key word 
and retain the relevant documents for this key word (if any) that also 
match the relevant documents for the first key word. This procedure 
continues until all the specified key words have been consulted, and the 
common relevant documents are isolated. This procedure has the advan- 
tage that only the specified key-word categories need be consulted, not 
the entire list. However, a strict ordering of the key words In the file is 
required so that they can be found easily. Physically, this method corre- 
sponds to the manual use of "peekaboo” cards, illustrated in Figure KMb. 

' See G. Salton, “Progress in Automatic Information Retrieval," IEEE Spectrum, August, 

1965, pp. 9(1-103. This review article contains an extensive bibliography. 
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ItM Edge-punched Kern cards and peek-a-boo characteristic cards. In (a), one card 
is made tor each «em 'document lo ft We. and the characteristic (key word) is punched 
jt specified edge positions Items 1 end 2 of Matrix C, Figure 15-3, are shown. To 
extract all Hems with a given characteristic, a needle is Inserted in the edge position 
(hole or notch) for that characteristic (8 »s shown). All Items with that characteristic 
tall out of the Pie in (b), one card *s made for each characteristic, and items having 
that characteristic are indicated by punching out a hole In a designated Item position. 
Characteristics A and B of Figure 10-3 are shown. To find items with given characteris- 
tics. the desired Characteristic cards are aligned Light wUt pass through the item 
holes that have ail the specified characteristics, here documents 2 and S. 


Example A third form of file organization and search {known as the 
Tabled ex Method) »s a tabular combination of the previous procedures, 
which drastically reduces search time at the expense of file complexity. 
The file u organized by mealing a table for each characteristic, or key- 
word in the present case, and listing in order as subheadings the items 
{documents ) described by the given key word. So far the organization is 
similar to the previous example. However, we now expand the table by 
listing for each document shown all of the pertinent characteristics 
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greater in order than the characteristic, or key-word code, of the table 
at hand. To accomplish this we use the alphabetical ordering of key 
words and numerical ordering of documents shown in Figure 10-3 for 
the term-document matrix C. Figure 10-5 shows the complex Tabledex 
file organization for the data of Figure 10-3. Note that the characteristic 
tables are arranged in alphabetical order, that the documents in each 
table are in strict numerical order (where they appear), and that the 
characteristics for each document so listed in a table are in alphabetical 
Order and contain entries of higher alphabetical order than the table 
designation (again, where they appear). 

To search for documents described by a given set of key words (or 
items described by a given set of characteristics) we first put the desired 
characteristics in ascending (alphabetical) order. We then scan the 
Tabledex file for the table corresponding to the lowest order of the 
desired characteristic. We next scan the document list in this table for 
documents (if any) showing the remaining key words. If we find such 
documents, they are the desired ones, and our search ends. We need 
only consult one table in this method of search. 

For example, suppose we wanted the documents described by key 
words B and C. Because these key words are in order, we consult the 


Table 

A 

Table 8 


Table C 

Documents 

Key words 

Documents 

Key words 

Documents 

Key words 

1 

D 

2 

D 

3 


D,E,F 

2 

B,D 

4 

C,F 

4 


F 

5 

D,E 

7 

D,F 

6 


E 

B 

B,C 

8 

C 

8 


(None) 

Table 

D 

Table E 


Table F 

Documents 

Key words 

Documents 

Key words 

Documents 

Key words 

1 

(None) 

3 

F 

3 


(None) 

2 

(None) 

5 

(None) 

4 


(None) 

3 

E,F 

6 

(None) 

7 


(None) 

5 

E 






7 

F 







10-5 A tabledex file organization for the term-document data of Figure 10-3. One 
table Is prepared for each Key word (coded alphabetically here). The tables are 
ordered alphabetically (by the code) as shown from left to right down the page above 
To find a document associated with a given set of key words, the keywords are first 
ordered by their code. Then the tabfe for the first keyword Is found. Documents de- 
scribed by the other key words specified (if any) will appear in that table and may be 
tound by inspection without recourse to other Tables In the file. 
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TaUed«Fifeand find Table B.The docu meets described by B we listed. 
We cert scan the keyword listing for each docmnent shown «“taiiltag 
C The only documents satisfying the request ate documents 4 and 8. 

A manual Tabiedei 6!e may be constructed as a book of pages. with 
one page for each key-ssord table, and the remaining detad printed on 
tlie page. {A computer cap be programed to produce such a book if 
desiied-i Then, to locate a given document, we turn to the page repre- 
senting the lowest order key word, and 2nd the request (if it exists) on 
that page, by following the above method. 


Searching with Retaxed Conditions 

With any of the above procedures, the search f Qr a given item (or docu- 
ment ) may possibly not lead to any items having the specified characteris- 
tics [ orVey words) iS the characteristics -required wt too msmfctocfc. to swe-h. 
a case, the number of requirements must be reduced to find some items 
that are reasonably desirable. 

Tbs relaxation of conditions raises questions about which of the charac- 
teristics to discard and which to retain in a further search. First , a combina- 
torial problem arises. For example, if, say, five key words are used in the 
search for a document and the search fails, we might think of using only 
four key words, and then repeating the search. However, the number of 
ways four key words can be selected from the original five is five, the number 
of ways three keywords can be selected from five is ten, the number of ways 
two key words can be selected from five is ten, and so on. ,<> We may have to 
perform a senes of searches of this type before finding any relevant docu- 
ments The situation worsens severely as tbe original number of desired 
key words increases. 

Fortunately, several methods combat this combinatorial difficulty. For 
example, the search procedure described on page 274 will give the number 
of matches that do occur for the specified key words, and therefore serves 
to rank tbe documents in the order of relevance (by the number of matches) 
even though no perfect matching document can be found. 

Example Suppose, using the data of the term-document matrix fC] of 
Figure 10-3, we desired to find documents described by key- words A, B, 
C, and D \\ ithout knowledge of u-hat is in the file, we may thus form 
the column request vector q, and multiplication of tbe term-document 
matm C by the request vector q <m that order) will produce a column 
vector r which, m egect, evaluates the documents in the file in terms of 
the request. Thus, for the four key words suggested: 

- JWaB thrt, in Renrral, the number of * a> , , can U selected from n h 

«!/{<rf) (« — *)!]. 
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[C] X (?) = [r] 

Key word Document 



The results show that no document is described by all four specified 
key words (because there are not four matches), but that two docu- 
ments show three matches (documents 2 and 8), and several show two 
matches. One order of preference would be to inspect the documents 
in the order indicated by the number of matches. 11 

The concept just illustrated is easily extended to the case in which some of 
the key words are more important in the search than others. In such a 
case, the original request vector q may show weighted values for each de- 
sired key word (instead of the binary indication of the previous example), 
and the process can be repeated. For example, suppose we still want key 
words A, B, C, and D, but that the key word C is thought twice as important 
as the other key words. Then we have 


Key word 

[C] X A 
B 
C 
D 
E 
F 


Weighted 

Request 



Document 

1 

2 

3 

4 

5 

6 

7 

8 


Weighted 

Matches 



11 Ledley, op. cit., pp. 506-510, indicates an alternate sorting procedure to achieve the 
same result. 
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which now prefers document 4. Although such weights may be arbitrary, 
depending upon the investigators choice it is also possible to generate 
weights based on the possible correlation of key words and documents in 
the file, and such methods are often used in automated retrieval informa- 
tion systems.” 

Finally, there may be forms of search in winch certain hey words (or 
characteristics) must be satisfied, and other key words, although desirable, 
are not necessary. In such cases, the search can proceed by first partitioning 
the file (and therefore the corresponding term-document matrix) by sorting 
out the documents with the mandatory key words, and then continuing as 
before. 

Example: Suppose that for the file described by matrix C of Figure 10-3 
we wish to find documents described by key words A, B, and C, but that 
term A is mandatory. Then, by a sort on key word A (and deletion of 
key words D, E, and F, which are now irrelevant to the search), we 
have a reduced term-document matrix C SE w, which may now be searched 
for key words B and C as before. 


Document 


Key word 
B C 


i*wl X hfntw-l c 

Key word 


l r n«wl 


Document Matches 


l 


x b m - i 

V 

2 

> ‘ 

C Llj 2 

2 

5 

I 0 

5 

> 

8 

.1 1. 

8 

.2. 


me new term-document matrix contains only rows 1, 2, 5, and 8, because 
these are the only documents u the Ble having the mandatory key word A; 
it contains only columns B and C, because these are the only pertinent 
key words remaining (or the search. The result shows that documents 
2 and 8 meet all three requirements, because we know the doeu- 
menu have key word A (by virtue of the original sort) and because two 
matches indicate that both key words B and C have been matched >• 


Indexing 

T?." 1 ,ha ' " CmS ” documra,s fiW 

associated with them defame charactenst.es which arc dearly rccogniaed 

“See Saiton, op nr , pp 9-t_Q5 

U li useful to note that the ffutrii hv vertnc t„ . .. , 

is equivalent to making a count for each USed m ^ examples above 

r. that appear i„ in *u««Mon. of the number of 

^ctshted «m.« „ w not, W?vT«or y TequCSt * or lhe of a 
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and which can be attributed to a given item when the Ble is made up. For 
example, suppose a file shows product assemblies as items, and the attributes 
of these assemblies are their specific parts. Then the relationship of part to 
assembly is clearly defined by product specification, or blueprint, and no 
ambiguity should occur in the assignment of “characteristics” to items. A 
similar situation obtains in highly structured information systems, such as 
the chart of accounts: The characteristics that should cause a given cost to 
be posted to a given account are defined, so that the association defined by 
the item characteristic table is clear. 

More generally, however, the job of assigning hey words to documents, 
or characteristics to items, which is called indexing, is not so clear-cut. For 
example, suppose we have a technical article that refers to ( 1 ) aircraft, and 
(2) manufacturing, and that these key words are assigned to that document. 
The article in question will not be found if a request is made for all articles 
described by the key words (1) airplane and (2) production. This problem 
is distinctly different from the cases described above for the assembly file 
and the cost accounting file, where the specification of a few rigidly pre- 
scribed terms will locate a unique item, or set of items. In the more general 
information retrieval problem, a given item or document can be described 
in an infinite variety of ways and shades of meaning which places an un- 
realistic burden on the indexing, processing, and request operations. For the 
rigidly indexed file, the terms used by the indexer and the requester must 
be complete and absolutely consistent to provide the desired result. 

A number of devices are used to alleviate this somewhat unrealistic 
requirement. Some intermediate device or step is required to overcome 
inconsistencies and incompleteness which often arise as the popularity of 
descriptive terms changes, as personnel changes, and as the scope of the 
master file increases. 


Dictionaries 

Acceptable term dictionaries are often provided, so that the indexer and 
the requester can use similar words in the input and output operations. Fre- 
quently, this approach is expanded to include a thesaurus of terms to which 
a common code may be applied. For example, aircraft, airplane, and other 
synonyms for the same type of flight equipment may be coded to a single 
identifier, say subject 3.16, which is thereafter used as the indexing and 
search code for Sling and retrieval. Similarly, concepts and operations may 
also be given a code number, and a dictionary or thesaurus of concepts 
arranged in a hierarchy of generality may be developed, so that a given 
specification can be related to more or less general specifications, as the 
need arises. Again, the same idea may be used to develop phrase dictio- 
naries, in which, say, the terms "cost” and “accounting” are combined into the 
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mote useful descriptive term 'cast accounting” to be coded and used for 

'^Sucb^devices are usually employed in automated information retrieval 
systems, so that minor variations in indexing and request formats can be 
handled. Another approach, also used in automated information systems, is 
to compote the statistical correlation between terms used in the file, and 
between items that have similar descriptions. This can be accomplished for 
a given file, and revised by tecomputation ns the contents of the file 
changes. The computation may also be arranged in stages to suggest alter- 
nate terms for search, or to provide an automatic weighting for the terms 
used in a request, and for those terms generated automatically from the 
request. 

For example, in a given file we may find that the term “aircraft” appears 
frequently with, and therefore is highly correlated with, such terms as 
“airframe" “fusilage,” “propeller," “acceleration" “jet," and “supersonic” 
However, the terms associated with “aircraft* may also be associated in the 
file with the term "airplane,'* which may be thought of as a second-order 
term. Thus, the analysis of the contiguous modifying terras for one more 
general term, may in turn lead to one or more synonyms for it as those 
synonyms arc employed in the file itself. By this procedure, the term “air- 
plane" could automatically be added to the initial request for documents 
pertinent to “aircraft." Thus, the original request for an item may be ex- 
panded or contracted by the information processing system, and weights 
may be computed for evaluation of document pertinence along the lines 
previously described 

As we shall sec m Chapter 1L, the information search procedures we 
liave discussed here can be expanded to many applications, including the 
search for causes of system difficulty, when we are given a set of observed 
symptoms. 


SOME RANDOM SEARCH PROBLEMS 

bet us now torn to some search problems, which, although they involve 
isolating one or more specific items (rom a much larger, and perhaps 
infinite, population, accomplish this not by logical sequencing hut by 
random sampling. 7 

V"' ,at8ct ' “ "V'"" 1 ™ poms from many possible 

I , T,’ "JlrV PtMm ,n » “random” search prLdL 

" J" cf “ l though tt would not seem to at first, a randmn search 

“ wm anTITtl ' ' 

1<cm » not associated with the choice of the 
specific procedure used, the item wiH not be found. h 
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Example: Suppose a particular item belongs to a set of odd-numbered 
categories. If the investigator selects an examination or testing proce- 
dure which considers only even-numbered items, that procedure is 
doomed to failure. It will not find the desired item, and this failure is 
a property of the chosen search method. 

Similarly, a random search procedure is, on the average, one which can- 
not be defeated by opponent strategy. 

Example: Suppose the police search for crimes in progress by following 
a systematic surveillance route: The radio car makes a systematic tour 
of the community at specified hours. If this search pattern becomes 
known to criminals in the community, they will surely take advantage 
of the systematic pattern and commit their crimes when it calls for 
nonsurveillance. The same argument holds for both space and time. 
And so, many police forces make random tours of surveillance. 

In addition, the use of a random search procedure often gives the 
investigator a limiting condition for item isolation, which can be used as a 
standard against which other search methods can be compared. 

Example: Any knowledge of the probability distribution of an item’s 
location improves the probability of locating it. But it may be desirable 
to determine how much this information is worth. By comparing the 
probable return that can be achieved from a given amount of search 
effort by one means (assuming some knowledge of a pattern of operation 
or location) as opposed to a random search method (which requires 
no knowledge), the value of the alternate search procedure can be 
assessed. 

We now give several examples to illustrate these points. 


Random Search In an Area 

One of the classic military search problems is the location of a target, which 
may be in an unknown position (and therefore at random as far as the 
investigator knows) in a given area A, e.g., the problem of locating an 
enemy ship in the ocean. 

In this situation, an evaluation of the probability of the targets detection 
is often useful if a random search procedure is used. 

To develop this point, consider an area A, as shown in Figure 10-6, 
which contains a target X located at an unknown xy location. It is proposed 
to tour the area A by means of a random path of length L. Let us further 
suppose that the target X will be detected by the search procedure if 
the search path passes within a distance W/2 from the target’s location 
(i.e., the search path sweeps out a band of width W symmetrically centered 
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10-6 Random search In an area. In the bounded area, there is an object of Search 
X A random search path ol length L is made through A, It Is assumed that the lost 
object w'M be found it the area qI width W about L covers X in the random search. The 
total area swept out by the search path is WL, which is a traction V/U A of the total area 
under consideration. To derive the equation shown in the text, the area swept out by 
the search is divided into a number ot small boxes ol area WUn, each having a 
probability W UnA ol containing the lost object X. 


over the search point, and target detection definitely will occur if the band 
includes the target location, and will definitely not occur otherwise). Our 
purpose is to compute the probability of detection as a function of L and 
W, ui comparison to the search area A. 

We first divide the length of sweep L into rt equal parts, in effect we 
divide the area WL of the path swept into n small boxes of dimension 
(IV X L/n) The probability of detecting the target in one of those small 
boxes is given by the ratio of area (\VL/n)/A — WLfnA. The probability 
of detecting the target after traversing all the small boxes along L is 



which follows from the probability that the target will not be detected in 
the sequence of n boxes (considered to be independent, because of the 
random form of search and the random location of the target) being one 
minus the probability of detection at an individual box raised to the nth 
power,* 4 


" n , trlc *' « must not be detected on any of them. 

m “ 3u ? !,cat *■» hr «* probability, the probabilities {/failure are 

rilt “S LS” 5 ”" d «S P'obsbAly J 
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If we now let n become very Urge (i.e., Jet the area of the small boxes 
approach zero) we have the limiting condition 15 


V « 1 - c~ vrL,A 

where e is 2.718. This resuJt shows that as the length of the search path L 
increases, the probability of target detection approaches 1.0. This effect 
can be guickened by increasing the width of the detection band W. 

It is interesting to note that if the area \VL swept by the search path is 
so small compared to A, that there is little chance of overlapping, the prob- 
ability of detection becomes simply p = WL/A, the ratio of the two areas. 
The higher-order terms, which in the limit produce the exponential, take 
into account the overlapping that may occur with longer search paths, which 
will in effect produce sampling with replacement. Either of the above 
formulas tells us that more effort at random search (larger L) produces a 
situation of diminishing returns. An added unit of effort in search does not 
progressively produce the same added increment in the detection prob- 
ability, but rather a progressively smaller one as the search continues. 

We may now ask how much effort is worthwhile in a given search by 
this method? This question may be answered by evaluating the cost of a 
unit effort of search and the benefit to be had if the target is found. Thus, 
assuming that the benefit from target detection is $c, and that the cost of 
search is $c 2 /mile of L, we may write an equation for the total expected 
benefit from a search of length L as 


Total Expected Benefit in $ = c, ( I — e ' n ' c/A ) — c 2 L. 


This equation is a function of L and may be maximized in a number of 
ways. For example, if we take IF = 1 mile, A ~ 100 square miles, e, *= 
$1000, and c z ~ $5/mile, successive substitution of trial values of L will 
reveal that the optimum length of search L is, for the assumed conditions, 
about 70 miles. 18 

Because in the random search method described above, the target may 
in fact have been “discovered” more than once, we may ask for the prob- 
ability that the target has been found exactly m times in a tour of length l. 
If we call the ratio WL/A = 4>, the “coverage factor” of the search to the 


u The series of terms generated by ( 1 - WL/nA )‘ equals e' r ^ A if n — oe, i e , 


.H)'- 


The same result may also be achieved by other rawns. , , . ~ <• 

“ Tabulated values of e* and c" may be found in standard wwU »* ■* R [ 
ton. Handbook of Mathematical Tables and Formulas, 4th ed., McCnm Hill, JSew 
York, 1964. 
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total area A, the probability of d, severing the target exactly m times is 
given by the Poisson distribution'- 

The coverage factor * may be specified in various ways, depending 
upon the problem, but for random search, the basic relationship holds. In 
genera!, <p will represent the expected number of sightings made. 

For example, suppose that in the area A there are numerous target, not 
just one. Then the target density is given by the number of targets N divided 
by the area A, N/A. The average number of targets sighted in a tour of 
length L will be m ~ (WL)(N/A) = <f>- The probability of “discovering 
exactly m targets in a random tour of length L is given by the Poisson 
distribution and the new value of (where, of course, the assumptions of 
random scattering of the targets in area A, random search, and certain de- 
tection within W/2 of the path L are maintained as before). 

Example. Suppose a ship passes through a mine field in which there 
are N mines in a given area A. Suppose also that the mines have an 
effective range W/2 for the ship (i.e., a mine will attack the ship if the 
ship passes within W/2 of it). Then, <£ - WLN/A as before, the number 
of mines encountered in a path of length L is given by the Poisson dis- 
tribution, and the probability that the ship will encounter at least one 
mine m its tour L is { 1 — r*). The “mean free path" or average distance 
between mine encounters is l/£ - A/ XV LN. 

Although more general treatment must be left to the references, it Is 
worthwhile to consider one application of this result that gives a clue to the 
range of possibilities for further analysts. 

£xample Suppose a pack of S submarines may operate in one of two 
ways either independently, or as a group. For simplicity, we assume 
the same detection and search rules as above {i.e,, definite detection 
within a band W/2 of the random search path). The probability that 
a ship passing through area A will be detected at least once (by one or 


” gXmSl* l °T ° f may be Wd ta P. Morse and 

?asTX, \ teKarch. Technology Press and Wiley, New York. 

^ iwi 0 * 1 * » b r ® ° Koo^nao, at] in Operation, Research (The JouraS 

4 fniM Saaichi Part I, Kinermt.c Bases,” 

K- sT IS "■ °^r» d ““ £u “ rf 

during \\ oiVl Wat U W W.MIZB. This form of analysis was widely used 
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more of the submarines) is (1 - If F ships per month pass 

through the area, the expected number sighted (at least once) is” 

D = F(1 - c-*™). 

Suppose that the submarines fire lorpedos upon sighting, and that the 
probability of sinUn g the target after sighting is P. Then the expected 
number of ships sunk per month with S submarines operating indepen- 
dently in area A is 


H t = F(1 


Although the value of D (expected number of the F ships sighted as a 
function of the effort expended) saturates rapidly as the number of sub- 
marines S and the length of tour of each L is increased (because multiple 
sightings increase), when the probability of sinking upon sighting is 
small, the number of ships sunk II, does not saturate so quickly with 
increased effort, because the multiple sightings provide multiple tries 
at sinking the same target. 

When the submarines operate as a group instead of attacking sepa- 
rately, all will respond to a sighting made by one. Assume all the S 
submarines will be able to “home" on the first one and get their chance 
at sinking the ship. In this case the probability' that the ship is sunk is 
1 — (1 — P) s instead of the value of P which it had if only one sub- 
marine attacked. Thus, the expected number of ships sunk by the group 
method is 


H t =z F(1 — e~ <SWL/A> ti-u- w ). 

The relative advantage of group action over independent action is given 
by R = Hg/Hi. If this ratio is tabulated against the number of submarines 
S, for various ratios of WL/A, it may be seen that, if WL/A is large, the 
number of submarines in the optimum group moves toward 1.0, but if 
WL/A is very small (vast area to cover), continually increasing the si 2 e 
of the group is beneficial. 19 For example, data on Japanese shipping in- 
dicated that about three U.S. submarines per group would give optimum 
results in the Pacific. After the analysis and perfection of the group tactics 
by practice, the yield per submarine in a group of three was about 50% 
greater than the yield per independent submarine, confirming the analysis. 

“This cumber should not be confused with the expected number of “ship-sightings" 
made per month by the submarines which can exceed F and which is given by 
£ j_ SFWL/A. The number of ship sightings Would follow the Poisson distribution 
with the given <t> ■ 

“ See plot. Figure 1, Morse and Kimball, op. cti., p. 89. 
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In conclusion, many of the calculations involving search uv an area 
depend upon more complex detection and attach rules than the simple ones 
used for illustration here. For example, the use of alternate detection equip- 
ment fe.g., radar, sonar, the human eye, or other devices) can change the 
probability that a target will be sighted as a function of the distance from 
the point of search, A rule for visual detection often indicates that the 
probability of sighting falls off with the cube of the distance between the 
target and the defector. More complex rules hold when the relative move- 
ment of the target and the detector are known. 

To illustrate the effect of the detector upon the search process, consider 
the following practical problem of exploration in which a two-stage search 
process is used. This problem has been simplified somewhat for presentation, 
but it contains the essential ingredients of the industrial exploration process 
and the important concept of using several levels of search detail. 

Example; A valuable pnze is located in one of two areas. (As a prize, 
consider a mineral deposit, an oil or gas deposit, or a similar valuable 


Ares 1 


Area 2 


asset. ) Although u is known that there is one and only one prize in the 
total area, no information »s available on the exact location of the prize 
fte., the correct area, or box, is not "known). Two search methods are 
available 1 1) an inexpensive method, which costs 510,000 per look, and 
(2) an expensive method, which costs $1 million per look, and which 
must be used (e.g, drilling for oil) for prize acquisition. 

An obvious way of locating the prize with certainty is to use the 
expensive method, with a random choice for the first look. 


metjjop i ( for certain location in one stage ) 

Total Cost (in millions) “ / 1 )(0.5> -f (2)(05) = 


$15, 


brauje i[ the priK « not (out*) on the first expensive loot the process 
must be repeated m (he remaining area (t.e., two holes most lie drilled 
to locate and acquire the prize) 

He nmv ash whether or not a to o-stage -search and aajoitiitan proce- 
dure x'ouM he less enpcrmxc. llmvcvcr, the taexperaive detection 
methml «s not certain. Moreover, the method gives spurious readings, 
vvhid, twhrate the presence of the prim when „ „ Mt there, onJ 
occasional)) tails to detect the pm.- when it is (here. For example these 

£zzro“:*\ cUmth °< *■ <i *«™ 

uSm o, in S TT T vm ™ “"W in » «™v over tough 
"" " ” ,n “ OTfl ■Wh turbulent atmosphere can issue take 
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signals due to vibration, random noise, and the Iike-as well as provide 
true signals for only a fraction of the prizes passed over. 

Nevertheless, suppose we decide on the following plan (which for 
illustration is simple to avoid mathematical complexity): Two looks will 
be taken in each of the two areas, and any positive signals obtained by 
the inexpensive method (true or false) wall be recorded. If one area 
scores higher than the other, the certain method of detection and acquisi- 
tion will be used first in the high-score area. Should the area scores be 
tied, we will revert to the random use of the expensive method exclu- 
sively. In either case, if the first expensive look does not yield the prize, 
the expensive method will be used on the remaining area, to acquire 
the prize with certainty. 

We now turn to a cost evaluation of the second method. Let us 
assume the following data for illustration. 

A — the probability that the inexpensive look will yield a false positive 
reading = 0.1 per look. 

p — the probability that the inexpensive look will yield a true positive 
reading when the prize is there = 0-2 per look. 

With these data available, we can now' compute (1) the probability 
that the prize will be acquired on the first expensive look, (2) the prob- 
ability that the prize must be located by random application of the ex- 
pensive look, because the scores are tied, or (3) the probability that two 
expensive looks will be required, because the inexpensive look gave a 
false positive reading. 

To summarize these computations, suppose the prize is actually in 
Area 2 of the diagram below, so that the probability of getting a signal 
(false) from Area 1 is A per look, and the probability of getting a signal 
from Area 2 (true or false) is A + p - Ap = a per look. The actual scores 
generated by these two basic probability values, which we assume to be 
constant regardless of the number of looks, will have the binomial dis- 
tribution. Thus, for two inexpensive looks in each area, we may sum- 
marize the nine outcome score combinations possible in the table below, 
which also shows the correct probability values for each possibility. 


Probability of a signal ~ x 


Probability of a signal ~\ + p — 


PRIZE IS HERE 


Note: In Area 2, a signal may arise either because the prize is there, with 
probability p, or from spurious causes with probability X. Because only- 
one signal is obtained per look, both causes du not occur together. 
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P(X U X 2 ) 


Appropriate formula 


0.41990 

0.32639 

0.06330 

0.09331 

0.07258 

0.01411 

0.00518 

0.00403 

0.00078 


<i~xr(i~«r 

2(1 •“* X)*a(l — a) 

a -xjv 

2{*)(1 ~X)(1 — a)' 
4(X)(l~X)c (1 -«*) 

X'(l - a)' 

2X’n(l - a) 


mu. 099993 (or 1.0000 i D four decimal places) 

In the above table X. nnA v 

be obtained by recording the rKote'of'SSTo’ 11 ’? S “ res ,fat «»H 
presence of a positive signal on a civ!^! \ — '1 A,HU 1 and " "Hre 
negative signal by "0 * ^ S 0t “ mi!, na<ed by "1” and a 

*> *« by 

combination more compactly. probability of each score 



= P(X t = X*) s 


0.49326 


2r £££ tt F ” lib ' lito *» 

2^ S £5 *■*■**■«-•.* 

««ed a„7* “ S ? fa ' Vh,chfcs ““ 

ttraSSsttus 

probabilities abm .. -,i *” ' ron § area. 

-=aa.*i5£S=SjSas 

n ot lhe two boxes 
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MCTTOD I {certain detection of main prize to me stage with possible con- 
solation prize) 

(i)(050) + {15><0.5) = $1.25 million 
ittrmoD n (certain detection of main prize in two stages with possible 
consolation prize) 

(I) (0.40420 + (U5) (0.49326) 

+ (15) (0.10252 + (4) (0.01) = 5U146miUion 

Ihe net dsSerence between Methods I and H has now dropped to just 
over $43,030. Thus, in this case, the presence of the consolation prize 
not only reduces the total cost of both methods, but also males the 
random approach relatively more attractive than it was before. 

Finally, we can see (although a proof will not he given) that the two* 
stage-search process becomes more attractive when there are few or no 
consolation prizes, and when there may be a large number of no-prize 
areas (more areas than the two used for illustration above). In this case, 
there would be a large failure total For the expensive locdcs, which the 
first-stage search could eliminate or reduce. 

Various methods have been proposed and developed for exploiting 
multistage search procedures m prospecting and similar applications. 
For further reference, the reader should consult the literature on the 
subject.*’ Unfortunately for our present purposes, practical search prob- 
lems of this type often require special study and the use of advanced 
analytic and computational methods. The example just given, however, 
illustrates the basic principles involved, and is sufficient as an introduc- 
tion to this interesting field. 

Search For extreme Values 

The use of random search procedures fox the location of an extreme value, 
or set of extreme values, is of immediate interest. 

for example, as mentioned previously, the number of sequences that 
can be made of distinct items is VI. Because Ml rapidly increases as N 
increases, the brute-force search through MM possibilities for a particular 
sequence of X items that may have the greatest value or lowest cost is an 
impossibility Yet, this scheduling os sequencing problem is faced industri- 
ally every day Can a random search procedure help here? 

* 1 " CJuamns in Mmwalosiasl and Other Surveys.' - Froc First ln~ 

temetumal Contes* on Operotkwx RcmotcM, Operation* Rev-arch Society of Amer- 
pi' 195 V“ T 1 C Ca$ths I C*™. “Simulation 

196 ?’ M *?***’ of Appmsioz Economic Prospects 

o! Mifl-OR Exploration wo tiwse Temtcntv-Mgman Sahara Case Study" Mon- 

li; 



Some Random Search Problems 293 


Yes, and the results are similar to those shown above. Suppose that N 
items are to be scheduled, and that it were possible to list all the Nt possible 
schedules, together with a measure of their cost. 21 

Suppose each of these possible schedules can be arranged in rank 
order by cost, so that the lowest cost schedule will appear first in the list, 
the next highest cost schedule next, and so on. ( Schedules with tied costs 
are listed adjacently without regard to the particular schedule. ) From this 
rank-order distribution, it is now possible to determine the probability that 
an optimum, or near optimum, schedule will be found by random sampling, 
or search. Although, in general, this rank-order distribution will not be 
known, in many cases it may be approximated to obtain the same results. 

To illustrate, suppose that N 1 possible schedules of N items have been 
made, evaluated, and arranged in rank order by cost. We may want to 
find a schedule in the top 10% or top 2 % of this list, let us say in the top 
fraction p. Because the total list is not available, we generate a set of 
schedules n at random and rank the sample set in order, taking the cheapest 
sample schedule as the “best” schedule available. This rule gives a geometric 
reduction in variety to combat the factorial. Thus, the probability that one 
randomly constructed schedule will be one of the top fraction p, is simply 
p (0^p-=l). The probability that a sample of n will not contain one 
schedule of the top fraction p is (1 — p) n and therefore the probability that 
it will contain at least one member of the top fraction p is 1 — { 1 — p) n - 
It is interesting to note that the result is only indirectly dependent upon 
the total size of Nl (the population) by the specification of p. To illustrate, 
suppose that a sample of n = 100 is taken from a much larger population 
m, so that the sampling may be considered random- Then, if we ask for 
the probability that at least one of the n items will he in the top 1 % ( p = 
0.01), using logarithms we find the answer 0.6335, as follows. 

(I - p) - 0.99- 9.9 X 10 l 
log (0.99) == log(9.9) - 1 = 0.99564 ~ 1 = -0.00136 
100 log (0.99) = 100 X -0.00136 = -0.436 
To find the antilog of — 0.436 , we note that 

-0.436 = 0.564 - I 

so that 

(0.99) iM = 10* X Kb' « 0.3665 

Thus, the probability of not picking one member of the top 1% is 0.366 5, and 
the probability of picking at least one member of the top 1% is 1 - 0.3665 = 
0.6335. 

*' The cost of a sequence may be assessed in many ways. However, if we restrict our 
attention to the cost of pairs in the sequence only (following a Markov chain assump- 
tion) the cost of a sequence may be developed from a N X & matrix of pair costs, 
applied pair by pair. 
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In the tame way we could find that the probability ■ of 1 plcMng a ntembet 
of the top U with a sample of n = 1000 Is 0.99363, and thtf probability nses 
slowly with further increases in n. 

The reader may complain correctly that the sample of 1000 does very 
hide for the probability of fining a unique optimum in a set of Ni equally 
ldcely possibilities. For example, if N — 100, reference to standard tables 
gives logio 1001 * 160, so that 1001 = 10 ,M . One percent of 10 m is still a 
very large number, 10 m , which contains the unique optimum. In fact, the 
probability of finding the unique optimum with a sample of n = 100 is ap- 
proximately (100/100!)= 1(H 5 \ a finite, yet infinitesimal probability of 


success. 

How is it then that this sampling method works in practice? It is 
important to note that if each of the 100! sequences (or items) had a distinct 
value, and therefore were equally likely, no search procedure ihat required 
examination of each and every cate would be beaeficial~Or physically 
possible. However, two factors work in our favor. First, we seldom encounter 
a problem us which the range of distinctions is so great that we cannot 
invoke the concept of indifierence (see Chapter 7) and be satisfied with 
a range of possibilities that are sufficiently good as to be called optimum, 
or successful. This corresponds to the detection area concept in the area- 
search examples of the previous section. The larger the detection area, or 
the larger the range of satisfactory answers, the easier the search becomes. 
Second, the distribution of distinctions is seldom uniform, but is usually 
clustered, so that some advantage may be taken of the knowledge of this 
constraint both in specifying a satisfactory answer, and in restricting random 
sampling to an area containing the optimum. 


Example ■ Suppose that N\ sequences could be generated at random 
and that the values of these sequences could be plotted in a many-step 
histogram Suppose also that this histogram could be fitted by a normal 
mine with s = 1000 and o- = 100. Then knowledge of this pattern would 
tell us that the probability of finding a sequence of cost less than or 
equal to i — 2.33* = <67 is about IS, and we could adjust our sample 
size to obtain at least one sequence below that value. Moreover, if, 
became of some structural knowledge of the costs involved, we know 
how to generate sequences that fall below this cost (767), we could 
restrict our investigation to that portion of the population, and achieve 
even better results 


WHEH TESTS CAN BE MADE ON A GROUP OF POSSIBILITIES 

Tf <ha ' 3n < ’ p,im " m "» •* *»■»* « be isolated 
individually. I, may .to be true that the contain* of one item In the set 
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of possibilities to be examined will provide a group test. When this is 
possible, the binary splitting method may be employed in a series of yes-no 
tests which greatly reduce the number of tests required, or which in com- 
bination with some individual testing will isolate the required item. 

For example, as mentioned previously, the number of yes-no questions 
required to isolate one item from N is the next integer larger than Iog 2 N. 
Thus, if it were possible to use a binary test procedure to locate an item 
from a set of 1001 possibilities, the number of tests required would be log 2 
1001 Because log l0 1001 = 160, and log 2 N = 3.322 log 10 N, 160 X 3.322 = 
532 (rounding oS to the nearest integer) tests would be the minimum re- 
quired to guarantee isolation by the binary procedure if it could be applied. 
This result again indicates the power of the binary questioning method when 
it can be employed. 

Let us take a case, however, which does not easily permit complete 
binary testing, but which, nevertheless, permits a group test to be made. 

Example: Suppose that blood samples have been taken from IV persons, 
and that each of these samples has been numbered, for later identifica- 
tion. Then, suppose a test (which will have a yes-no, or positive-negative 
result) is made by pooling a fraction of each blood sample in an over- 
all group test. This may be done, for example, by taking half the blood 
in each subject's test tube and pooling the blood so obtained. The re- 
maining blood samples wjJJ not be pooled, but will be left in the original 
distinctly identified test tubes. If we are looking for patients who have 
a certain disease, for which the test has been designed, and if the pooled 
test will tum out negative if and only if all of the individual samples 
pooled are negative (one positive sample will cause the pooled test to 
be positive), we have the following results. 

Let p~ the probability that a patient has a positive individual test 
n = the number of patients pooled 

Then, the probability that the pooled test will fail (requiring individual 
tests of the n patients set aside for future reference) is again 1 -(1 - 
p)» if there are N individuals to be tested, and if c, is the cost of a group 
test* and c 3 is the cost of an individual test, then the expected total cost 
to test N patients by this procedure is 

Total Expected Cost _ (N\ _ _ 

(group of N patients) \nj \nj 
where the first term on the right-hand side is the cost of group testing, 
and the second term is the cost of testing n individuals in the expected 
number of groups that fail the group test. This equation may be mini- 
mized for a given value of p by variation of n. It will be seen that, if p 
is small, the optimum value of n will be large, and vice versa. To illus- 
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tote, suppose p = 0.1, c, = c, = 510, and » 

various choices oEn = 1,5,3, 4, and 5. 


= 100 . 


n 

1 

2 

3 

4 

5 


Group cost 
$1000 
500 
303 
250 
200 


Individual cost 

<10)(2){50){0.19) « 190 
(10) (3) (33) (0.27) = 270 
(10)(4)(25){0.34> « 340 
(10>(5)(20) (0.41) =410 


We now tabulate 


Total cost per 100 
$1000 
690 
603 
590 
610 


The lowest cost is §590 per 100, or $5.90 per patient, and represents a 
reduction in the cost per patient to less than 602 of the individual patient 
cost 11 This form of testing is suitable when we can exploit the constraint 
that all the individuals must be -mgative" for the group to be "negative, 
and when such a group test can be made without examining the in- 
dividuals. 


An oft-repeated logical problem provides another example of group 
testing in which the number of tests may be reduced to one, by the exploita- 
tion of constraints. 


Example: A sheik receives eight tribal representatives, each of whom 
is to pay taxes for his tribe. Payment is made in gold bars, weighing 
exactly one pound each, and each taxpayer places his payment in a 
separate pile before the sheik so that each tribe is identified by position. 
Before the gold bars are gathered together for the treasury, the sheik is 
informed that one (and only one) of the tribes has cheated by shaving 
exactly one ounce of gold from each bar tendered in payment. However, 
before he can reveal the tribe, the informer is murdered by an unknown 
assailant With this information available, however, the sheik can easily 
determine the guilty tribe. For this purpose, he has a scale calibrated in 
pounds and ounces, and in one weighing he is able to isolate the culprit. 
How }$ this done 0 

The solution to this problem exploits both the arrangement of the 
piles, which is one constraint, and the knowledge of the weights of the 
good and the defective bars which provide enough additional constraints 
to produce a unique solution Assuming that each pile contains eight or 
more bars, the sheik places on the scale one bar from the first pile, two 
bars from the second, three bars from the third, and so on for all eight 
piles The ounces of shortage indicate the pile containing the shaved 
bars, thus solving the problem in one test. This result is considerably 
better than the binary split procedure, because the system of gold piles 
lias been treated as a single eight-state problem instead of a series of 


'te ,U3S '° rh "«v ‘id i„ ^ 
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two-state problems, and because the facts of the case permit direct iso- 
lation of the one defective state in eight. 

To see the effect of the stated constraints, it is worthwhile to consider 
the procedures the sheik might follow if (1) it were known that only 
one of the gold bars in one of the piles had been shaved one ounce, or 
(2) it were known that some of the gold bars in one of the piles had 
been shaved one ounce each, or (3) it were known that there were one 
or more bars shaved by one ounce somewhere in the collection of bars, 
fn (1) and (2), a minimum of three weighings using the binary spht 
procedure would be required to isolate the defective pile. In (3) the 
number of weighings would depend upon the way in which the defective 
bars were distributed in the piles. (How many weighings would be 
necessary if there were exactly one defective bar in each of the eight 
piles— and this fact were not known beforehand?) 

Although the examples of this section have been chosen for their dra- 
matic value, group testing often occurs in practice before the investigator 
sets to work. For example, in a system of N components or blocks, each of 
which must work for the system to operate, knowledge of system malfunc- 
tion is essentially the result of a group test that failed. Similarly, when a 
system is subjected to a response test, that test is a group test, which for 
the tested input-output relation requires all the blocks in a given chain to 
be operative. For this reason, a preferred first-response test for a complex 
system is one that will require the correct operation of all the system blocks, 
because if this test passes, no further testing may be needed. 

When a group test fails, we may then employ a series of block or in- 
dividual tests as indicated by the failure symptoms or by the logical require- 
ments for isolation, as previously described. 


SEARCH WITH PARTIAL INFORMATION 

We now present two selected search problems that illustrate how partial 
information may be exploited to improve the search process. 


The Search (or a Unimodal Optimum 

If wc know that an optimum set of conditions for a system exists within a 
certain range, and that the relationship between a given adjustment and 
the optimum is “unimodal " i.e., that the adjustment will cause the system 
to rise steadily to its optimum condition, and that further adjustment will 
cause a steady decline from that optimum, we have extremely valuable 
information to aid in the search process.* 3 

» rj^ e nonna j cu rve, lor example, is unimodal, with the mode at the average. The normal 
curve falls away steadily on both sides of the average, and has no lower peaks. 
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For example, suppose a tptm’s output/input ratio (yirid) efti be 
varied by a change of a single parameter ( say, temperature) r. The rela 
sliin between i and the yield f(x) is not fauwn completely, but : S“PP“ C 
it is known that optimum adjustment of x lies within the range 0 — x » 
and that f(x) can be evaluated within the specified range^of x at selected 
points. Finally, suppose that the yield f(x) is “unimodal,” te„ there is a 
Single optimum x, such that x a is within or at the boundary of the interval 
0 to L, and that f(x) is strictly increasing lot x ^ x„ and strictly decreasing 
for x > x a {or else strictly increasing for x < x„ and strictly decreasing for 

*=* bl- 
under these conditions, the following steps ofier an optimum ejepen- 
mental search procedure, which we will first describe graphically, and then 
improve by more detailed specification.** 

Suppose /(*) looks like Figure 10~7a within the inten'al O^x^L. 
The curve shown is “unimodaT as required [which means there is only one 
hump or peak in the curve for /(x)}. Because the value of f{x) must, by 
this assumption, steadily decrease on either side of the optimum Xo, two 
initial measurements, x, and x, >x lf can produce one of the following 
three outcomes: 

/(* ,) > ji x t ) :The optimum must he to the left of x„ and the region of 
search can be restricted to the interval 0 to x t . 

f{x i ) < /( xi ) : The optimum must lie to the right of x, and the search can 
be restricted to the inten’al x, to L. 

f(*>) ~ t(xz). The optimum is in the inten'al x, to x*. However, to be 
consistent with what follows, we may restrict further search either to the 
interval 0 to x, or x, to L. 

The net result of the Erst two experimental measurements is to reduce 
the interval 0 to L, to one of the smaller intervals indicated above. Within 
that reduced interval, there will be one measurement that can he used in 
the nett step For example, in Figure 10-7a, the test shows the optimum 
is m the inters al 0 to x,. One added measurement in that inten’al, say 
0, x, ~x ? , wall allow the test to be repeated again, and so on thereafter, 
until the optimum x„ can be located within a narrow interval, which may be 
made as small as desired by continued experimental measurements. 

The abos c procedure will work if the location of the experimental mea- 
suranents is chosen arbitrarily. However, the following rules will provide 
an optimum selection of these points. 

Suppose that it is desired to locate the optimum x, within a narrow band. 
Then we can divide the interval 0 to L into a number of small segments 

?e aSSSt SsSsMM 
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10-7 Procedure for optimal unimodal search In one dimension. 


whose width equals that band or interval of isolation. Thus, if we make F 
divisions, the isolation interval width is L/F, and we can, if we chose, 
measure the length of L in those units, i.e., five equal intervals of length 
L/5, etc. When this decision has been made, we can consult the following 
table to determine how many experimental measurements n will be needed, 
to provide the required isolation. 

p. i i 2 3 5 8 13 21 34 55 .. . 

^ 0 T 2 3 4 5 6 7 8 9 . . . ' 
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For cample, suppose a systems output/iopot atio (yield) 
varied by a chaoge of a single parameter t say, temperature) x. Tfce reteMn 
strip between r and tie yield /(a) is not biown completely, but suppos 
it is known that optimum adjustment of r lies within the iange 0 — * “ 
and that fix) can be evaluated within the specified range of x at selected 
points. Finally, suppose that the yield /<x) is "unimoda!" i.e., there ts a 
single optimum x a such that x® is within or at the boundary of the interval 
0 to L, and that f(x) is strictly increasing for x =£ x® and strictly decreasing 
for x > x® (or else strictly increasing for x < r, and strictly decreasing for 


x5sx„). 

Under these conditions, the following steps offer an optimum expen- 
mental search procedure, which sve wilt first describe graphically, and then 
improve by more detailed specification.** 

Suppose /(x) looks like Figure lfh-7a within the interval 
The cun'e shown is “unimodar as required (which means there is only one 
hump or peak in the curve for fix)]. Because the value of fix) must, by 
this assumption, steadily decrease on either side of the optimum x®, two 
initial measurements, x» and t* > x u can produce one of the following 
three outcomes-. 


/(x, ) > fix* ) The optimum must lie to the left of x 2 , and the region of 
search can be restricted to the interval 0 to x 2 . 


fi ti ) < Hxt)- The optimum must lie to the right of x, and the search can 
be restricted to the interval x , to L. 


fix i > - /(xs ; The optimum is in the interval x, to x z . However, to be 
consistent with what follows, we may xestrict further search either to the 
interval 0 to x z or s, to L 


The net result of the first two experimental measurements is to reduce 
the inters a! 0 to L, to one of the smaller intervals indicated above. Within 
that reduced interval, there wall be one measurement that can be used in 
die next step For example, m Figure l&~7a, the test shows the Optimum 
u m the interval 0 to x- One added measurement in that interval, say 
0, x, will allow the test to be repeated again, and so on thereafter, 
until the optimum x„ can be located within a narrow interval, which may be 
maik as small as desired by continued experimental measurements. 

The above procedure will work if the location of the experimental mea- 
surements is chosen arbitrarily. However, the following rules wiU provide 
an optimum selection of these points. 

Suppose that it is desired to locate the optimum x 0 within a narrow band. 
Then we can dmde the interval 0 to L into a number of small segments 


"" tW ***** fw »*«* ** R- E. be&um a »d 
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10-7 Procedure for optimal unlmodal search in one dimension. 


whose width equals that band or interval of isolation. Thus, if we make F 
divisions, the isolation interval width is L/F, and we can, if we chose, 
measure the length of L in those units, i.e., five equal intervals of length 
L/5, etc. When this decision has been made, we can consult the following 
table to determine how many experimental measurements n will be needed, 
to provide the required isolation. 

F: 1 l 2 3 5 8 13 21 34 55 .. . 

n: 01234567 89... 
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In general, the relationship between n and F is 

F(») = F(»-l) + F(*-a). F(0)=F(1)*0. 

The latter tenmrement indicates that one experimental measmement does 
no more good than none in taring the optimum. (We 'f “L of 
explain the table momentarily. However, it may be seen that any value 
F in the table is equal to the sum of the previous two values x>\ r . 

Let us consider first the case of isolation into one of two intervals. 
(F — 2, « = 2). The problem is first to select the position of the two mea- 
surements required, x, and The rule for optimum selection of these two 
values is ss 

x, ~ F ,-2 and x, ~ K-i 


where for n — 2, the appropriate values of F are obtained from the table. 
In this case, the values for x, and x- from the table are both 1-, i.e., both 
measurements should be made at the midpoint of the interval. (The method 
attempts to divide the interval into equal parts in this case.) However, two 
measurements at the same point will not reveal anything about the location 
of the optimum m the first or second part! Therefore, suppose the trial value 
Cor X) is offset by a very small amount «, so that x, = 1 — « and x, = 1. With 
this slight modification, the two test measurements will indicate that the 
optimum is either m the interval (0 to i) or in the interval (1 ^ t to 2), 
where again we have measured * in the number of divisions of the interval 
0 to L. Because < can be made arbitrarily small, the problem is resolved by 
this device. ( It is important to understand this case, because if F is greater 
than 2, the final test always resolves itself into the one described above. ) 

To improve the precision o! the isolation, \vc may now ask for the 
interval 0 to L to be divided into F ~ 5 parts. From the table, n — 4. For 
the first tw o trial values 5 , and x, we have 


x, = F. , = 2 and x, = s= 3, 


where the interval count is based on divisions equal to % of h. Figure 10-7 
(c) illustrates the first two selected measurements, from which we may 
conclude that the optimum (for the case shown) hes to the left of x 4 and 
is m one of the three segments numbered 1, 2, and 2. 

We now have a three-part problem Again, apply the formula for the 
location of the neat measure moots. Foe tire three-part problem, measurements 
should be at location! 1 and 2. the latter already existing, e.g., sc, will be at 
location 1 Now we see the optimum is above s*. 


S B ^ f*™ 10 ** «* 107 sufigert rt, c optimality 

^ p,D ? f he fount! In Beikoan 3n a Dreyfus, op, (ft . p. 154. 
££ valuts of F ( h a, f ft, *>^15c4 FiboiUcd numbers that octu'r 
J”™ > l “ an eight-part problem ^iU go to a five-, then a tW-, then a 

t^-o-part problem, »cw<3mg to the Fibonacci series 
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We now again have a two-part problem, with an existing experimental 
measurement at the midpoint between intervals 2 and 3. The last measure- 
ment x, could then be made at that point, but offset by a small amount e, 
as before. Application of the test as usual will now isolate the optimum in 
one or the other interval, interval 2 in this case, within the error t. 

This procedure can be used for isolation to any degree of precision by 
increasing F, and always works for one-dimensional optimization. Although 
the series of values for F, which is a function of n, increases slowly at first, 
it increases rapidly for larger values of n; e.g,, F(n~ 20) is greater than 
10,000. Thus, the position of a maximum can always be located within 10* 4 
of the original interval length (0 to L) in at most 20 observations Xi to x 20 , 
selected as described above. For large values of n, F(n) is given accurately 
by 1.45(1.62)" for this procedure, 2 * which may be compared with the 
equilavent binary procedure for isolation when group tests can be made, 
i.e., F(n) =2". Thus, although the method described is not quite as good 
as the binary split method, it provides geometrical cuts in variety in many 
cases where the binary split approach is not practical. 

The problem of determining optimal search procedures for locating the 
absolute maximum of a function of N variables is very difficult, and (al- 
though we will sketch out one extension) further discussion is beyond the 
scope of this Chapter. 21 

Hill Climbing Methods 

The experimental determination of “optimum'’ system response usually 
involves a series of experimental perturbations, or changes, in system condi- 
tions in small steps. The steps are generated and examined sequentially. The 
changes that produce the greatest increase in response guide the investigator 
to a new set of conditions, and so on, until the optimum is reached. This 
method is known as “the technique of steepest ascent.” 

Example: Suppose the output 2 of a chemical process depends upon 
two variables: Temperature x and pressure y. The system response z = 
f(x t y) is a three-dimensional surface, or a two-dimensional contour 
map, as shown in Figure 10-Sa. 2 * The contours shown in this hypothetical 

* See Bellman and Dreyfus, op. cit., p. 155. The procedure described above can easily 
be extended to cover the location of an optimum when x can take on only integer 
values, as shown by Bellman and Dreyfus on page 156. 

>* Bellman and Dreyfus, op. cit., provide selected references to this point, pp. 178-179. 

For a survey of experimental procedures, see O. L. Davies (Ed.), Design and Analysis 
of Industrial Experiments, Hafner, New York, 1954, Chapter 11. 

* Compare Figure 10-8a and 10-8b with. Figure 7-8. The contour map also may be used 
to show three variables, when drawn in three dimensions. A number of interesting 
experimental surfaces are illustrated in this way in O. L. Davies, op. cit.. Chapter 11. 

For a popular expository article with a bibliography of similar search methods, see 
j. m. idelsohn, “Ten Ways to Ftnd the Optimum,” Control Engineering, June, 1964. 
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10-5 Contour maps of a response surface tlt-ustraling the steepest response, or hill 
climbing, method of search for optimum conditions, (a) has a single optimum. Con- 
tours are shown as a percent ot the true optimum Vwttitb UTfavewn in genets*). Search 
U confined to the "region o! Interest," selected in advance from previous knowledge. 
With the single optimum, the method of steepest ascent leads sequentially to the opti- 
mum With the multiple optima of (b), the terminal point depends upon the starting 
point, and levetil starting points must be teed to isolate the global optimum. 


tut iace connect points of equal yield on tVie response surface, and are 
marked in percentages of the optimum (which is within the 90S contour 
and not shown}. Although the surface is shown for illustration, its shape 
and the location of the optimum are not known, and experimental 
observations axe expensive, so that extensive systematic measurements in 
the region of interest (limited to the rectangle bounded by x 0 and y 0 ) are 
not feasible. (If observations were inexpensive, we could systematically 
make a gnd of observations or take a large number of random observa- 
tions in the region of interest and develop Figure 10-fia in this way.) 

To start the steepest ascent method, assume that an experiment consist- 
ing of four observations about the point P has been conducted. If the error 
of measurement is small compared to the slope of the response surface, the 
direction of maximum ascent can be estimated from fee experimental points, 
suggesting an area around point Q for further investigation. The experiment 
is then repeated about point Q, and so on up the hill. In general, as such 
a sequence of experiments continues and the investigator comes near the 
top of the hill, the surface will tend to flatten out, or hump over and call 
for more careful determination of the path of steepest ascent. (Typically 
linear estimates of the path are made initially. Upon coming near the 
optimum-a near stationary point”— higher-order estimates, such as para- 
bolic path estimates, &Te used. The computational details of this procedure 
are ghen by Davie* m the references cited.) 
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The liill climbing or steepest ascent method works for continuous re- 
sponse surface With a single peak. Should there be several peaks, the 
terminal point will depend upon the starting point of the search, so that 
a scries of starting points is often used when multiple peaks are suspected, 
Figure I(k-5(b), and the maximum of the maxima found by this method is 
taken as the global optimum. 1 * 


dynamic programing AS A SEARCH APPROACH 

To conclude our selected search examples, we turn now to the procedure of 
dynamic programing, 10 which also seeks an optimum in stages. 

There arc many systems optimization problems that may be treated 
cither as one search effort involving man)- combinations or, alternatively, 
as a scries of search problems, each involving a smaller number of combina- 
tions, In many cases, the latter choice is beneficial in reducing the computa- 
tion and evaluation involved in finding the optimizing conditions. 

Example: Suppose a lost object is located in one of three areas, labeled 
1, 2, and 3. The probability that the item will be found in a given area 
is given in the table below. Suppose that the cost of making an observa- 
tion varies from area to area; this cost is also shown below. In addition, 
suppose that the probability of finding the object (if it is in a given area) 
is 0.1 per observation; i.c., detection is not certain, but will improve as 
the number of “glimpses" increases. Jf a total of $30 is available for 
search, how should the glimpses, or observations, be allocated to the 
three regions to maximize the probability of finding the lost object? 


Area — i 



1 

2 

3 

Probability pi that object is in area i 

0.5 

0.3 

02 

Cost per glimpse Wt 

$10 

$ 5 

$ 3 

Glimpses allocated 

n, 

th 

»j 

Resources allocated n, Wi 

n t wi 

n 2 w 3 

n s w 3 


* The philosophy of taking 4 route to the optimum Out follows steps of ever-improving 
response (Increases in the objective function of Chapter 8) is adopted in a number of 
search methods. For example, it is used in linear programing. Figure 7-4, which, with 
hoear constraints and a Unear objective function, leads to the optimum if it exists. 
Similarly, searches through decision trees may proceed in this way, although several 
alternative starting points or initial search directions are usually required. For an 
illustration see Egon Balas, "An Additive Algorithm for Solving Linear Programs 
with Zero-One Variables” Operations Research, 13, 4, July-August, 1958, pp. 517-545. 

* See R. E, Bellman and S. E. Dreyfus, Applied linear Programing, Priacetoa Univer- 
sity Press Princeton, N.J., 1062, especially Chapters 1 and II. 
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If ,ve consider area J, the detection probability is gi'en Vr 
i te tW item is in a given region p. times the probab.hty that 
the item will be detected ii it is in region i, and n, glimpses have been 
Ste. The latter probability is given by < I ■ -Of >, 1 ”*>“ 

the probability that n, glimpses svffl tail to detect the : obire , 
present. Thus, the probability of detection m region r ivith m glimpses 

P,(n.) = (p.) (1 - 0.9-). 

(There are three such equations, svith i = 1,2,3. representing the situa- 
fcion in each region.) 

The problem may now be formally stated as 

Maximize £ f*i( n i) ( 1 *= 1,2,3) 

Subject to ^ n,te< — IV (i = 1,2,3) 

where \Y is $30, the total resource available. These two equations simply 
say that we should obtain the highest detection probability possible for 
the three areas as a whole, but that we cannot spend more than the 
total resources available, $30. Because the lost abject must be in cae 
and only one of the three areas, they may be considered independent, 
and the probabilities add, as shown, presenting a typical resource alloca- 
tion problem, which arises frequently in systems analysis and design. 

Before continuing, it is worthwhile to find the maximum number of 
glimpses that could be bought in each region if the total resource were 
spent there Division of the total available by the cost per glimpse for 
each region ( W/u ., } gives this result as a maximum of three glimpses 
m Area l, six glimpses jn Area 2, and ten glimpses in Area 3. 

One way to proceed with the solution would be to enumerate all 
feasible combinations of the n, values, vvuhin the $30 limit, evaluate the 
total detection probability, and select the combination of n t that gave 
the largest detection probability for $30. Some thought will reveal, Bow- 
es er, that even for this simple three-area problem such a tabulation 
would be extensive, and that for a many -area, large- total-resource 
problem, the table could get out of hand entirely. 

We therefore think of attacking the problem in stages, first consider- 
ing Area 1, then Area 2, and finally Area 3. To do this we will consider 
our specific problem as a special case of a more general class of alloca- 
tion problems. 1 c., imbed* our problem in the larger class, by assuming 
that the total resource available is not fixed, but may vary from 0 to W. 
When this is done, the detection probability m a given area will be a 
function of the resource available (because whatever the resource may be 
it lmuts the glimpses that can be made). With this idea in mind, we 
create a tabulation in which we first consider Area 1 only and compute 
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the probability of detection for various resource levels from 0 to VP. 
(In Table 10.1, only steps of $10 have been shown for simplicity.) 


Table 10.1 Stage 1 


w 

Possible fii 

f, = Pt= (0.5) (1 - 0.9-i) 

0 

0 

0.000 

10 

1 

0.050 

20 

2 

0.095 

30 

3 

0.135 


We stop tabulating W at $30, because this is the total resource available. 
For future reference, we call the one-stage probability of detection 
for each value of W. The value f t is the maximum detection probability 
that can be bought with the W available at each increment in Area 1. 

Turning to Stage 2 of our computations, we consider the effects of 
allocating various levels of the total resource W to the two areas { Area 
1 and Area 2) in an optimal way. For clarity in this computation, we 
show in complete detail all figures in Table 10-2, but we note that, in 
fact, only the lines with an entry in column (6) are of any interest. 
We allocate a given level of W to the two areas, considering Area 2 
first. The number of glimpses allocated to Area 2 is progressively in- 
creased. Any left-over resources are allocated to Area 1, and we pick 
up the largest or maximum detection probability available to us from 
that allocation to Area 1, namely, /,(W) from Table 10.1. Finally, in 
column (6) Table 10.2, we show for Area 1 and Area 2 the maximum 
detection probability possible for the specified level of \V. This process 
continues until W = $30, when the tabulation ceases. 

In Table 10.2, column (1) shows the level of W allowed as we step 
to higher values. These steps are now in $5 increments, because the cost 
per glimpse is $5 in Area 2. Then, in column (2) we show all possible 
values of n 2 , from 0 to the maximum possible. In column (3) we show 
the resources left over (W — rhto t ). Column (4) shows die detection 
probability in Area 2 for the value of n 2 specified on the same line. 
Column (5) shows the detection probability in Area I, which is the 
maximum possible for the resource left over (VV —n 2 w 2 ). Finally, column 
(6) shows the sum of columns (4) + (5), which is the maximum for any 
level of the total resource W. In short, the entries in column (6) repre- 
sent the maximum detection probability available if the total resource 
\V is allocated to Areas I and 2. For future use, we may discard ah 
lines in Stage 2 except those with an entry in column (6), and produce 
a "Condensed Stage 2" table, which more clearly illustrates this fact 
(see Table 10.3). The notation f? indicates the maximum two-stage 
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Table 10.3 Condensed Stage 2 
(Shows Essential Features of Table Above) 


w 


n 2 

u 

0 

0 

0 

0.000 

5 

0 

1 

0.030 

10 

0 

2 

0.057 

15 

0 

3 

0.081 

20 

1 

2 

0.107 

25 

1 

3 

0.131 

30 

1 

4 

0.153 


detection probability as a function of the level of XV. The optimum 
values of th and n t (from Table 10.1) are shown for reference. 

Finally, turning to the problem of Area 3, we may consider the Stage 3 
tabulation. We proceed just as before. Now, however, we may set IV = 
$30, because this is the last stage. Our objective in the tabulation is to 
find the optimum allocation of $30 to Areas 1, 2, and 3 (the original 
problem). This may be done by first concentrating on Area 3, and pro- 
gressively increasing n 3 from 0 to the maximum value of 10. XVhalever 
resource is left over at each increment (IV — n 3 tv 3 ) is then given maxi- 
mum use in the two-stage allocation (according to the values of XV and 
/ z shown in Table 10.3). 


Table 10.4 Stage 3 tor W = $30 Only 


(i) 

(2) 

(3) 

w 

(5) 

(S) 

W 

n 3 

(W — rtjtOs) 

P s = 

(0J2)(l- 0.9’*) 

ft (XV — n s w 3 ) 

/, - max(P,(n,) 

30 

0 

30 

0.000 

0.153 

- 


1 

27 

0.020 

0.131 

— 


2 

24 

0.038 

0.107 

— 


3 

21 

0.054 

0.107 

— 


4 

18 

0.069 

0.081 

— 


5 

15 

0.082 

0.081 

0.163 


6 

12 

0.094 

0.037 

— 


7 

9 

0.104 

0.030 

— 


8 

6 

0.114 

0.030 

— 


9 

3 

0.123 

0.000 

— 


10 

0 

0.131 

0.000 

“ 


It is extremely important to note that what we are about to do in 
Table 10.4 requires reference only one tabic beck, namely, to Table 10.3. 
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Foe our present purposes the original Table 10.1 is not needed. In Table 
10.4, column (1) is W = $30, column (2) shows increments of n., 
column (3) shows (W - r,#? 5 )» and column (4) shows P,(n s )- Column 
(5), which shows /,(1V ~ **<*«>» h Stained by setting W = W - n*ti>s 
in Table 10.3 and reading off the corresponding value of Column (6) 
shows the value of U « max lP,(n s ) +/, (W - n*u>,)3 which is the 
maximum detection probability that can be obtained by allocation of 
$30 worth of glimpses to the three areas. 

The solution to the problem posed is that, lor $>30, a maximum detec- 
tion probability of 0.163 may be obtained by the allocation of 5 glimpses 
to Area 3, which is evident from Table 10.4, and 3 glimpses to Area 2, 
which may be obtained by referring to Table 10.3. (The value of n* ~ 3 
corresponds to the $15 of resource remaining after $15 has been spent on 
the 5 glimpses in Area 3.) N’o glimpses are allocated to Area 1, because 
no funds remain after the previous optimum expenditures. 51 


Table 10.5 Summary Table tor Stage 3' 


w 

th 

rtj 

fls 

f. 

20 

0 

1 

5 

0.112 

21 

0 

3 

2 

0.119 

22 

0 

2 

4 

0.126 

23 

1 

2 

I 

0.127 

to 

0 

3 

3 

0.135 

25 

0 

2 

5 

0139 

2 Q 

1 

2 

2 

0 145 

27 

0 

3 

4 

0150 

28 

0 

2 

6 

0 151 

29 

1 

2 

3 

0161 

30 

0 

3 

5 

0183 


For 428 an alternate com- 
bination will abo give f>=zO J5l, 
namely, n. =a J. n, «3, n, =s l ) 


»DrW Sr„ d r“ f “"f ****»■ The same method would 

" '“.“"S” 1 problem had provided any number ol areas (or if 

another optlnurahon problem could have been formulated as a series rf 

to 1 wftefcSiiflf «, (lOT «° 

KlimfKei to t},* thtte g rBU varjc, wl >. * Jl£w *he allocation of 

tlut frwjwntly ,bmpt »h!£u fa, allocation ‘H* 0 ** 1 *«°ure« available. Note 

Ul. ,WM», e J u» "»»» 'P O .^ced. 

ovnh for oocU pulioW) C,S, ,h^ ' taral <e.g , 

otto no, be tourf o ra «‘“ltty of .hup, .tail, to alia. 

<4 I TOanatioa (e s , nurlet efet, 0 f w„ 
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small searches rather than one grand one). This may be seen by noting 
how the original optimization problem can be reformulated in the stage-by- 
stage approach. Note that the column designations of the Stage 3 Table 
(Table 10.4) are the same as those of the Stage 2 Table, (Table 10.2) with 
the index i advanced by 1. \Ve could thus state the original problem in the 
functional equation form a! 

f ( (W) = Max [Pi(w'i) + /,_,( W-ttf )], 

0 w 

where tc\ — n,w, (i ~ 0, 1, 2, 3, . . . ,k ) and f 0 = (0). This is a multistage 
search problem. To find fi we must have already found f f . u the result of a 
previous lesser-stage problem. Equation fi(W) is a so-called recursion 
equation for this reason. (The recursion starts with / 0 (W) =0.) 

The procedure illustrated is also efficient for numerical computation on 
electronic computers (it is useful in obtaining a specific solution to a specific 
problem without wasting memory space), because only the preceding table 
need be consulted to form the tabulation for a new stage, as illustrated in 
our example. The procedure may also be used for approximate solutions to 
problems involving continuously variable resource allocations, or continuous 
functions, illustrated here by the discrete probability of object location p„ 

Many additional advantages of this approach and computational tech- 
niques of its use are described by Bellman and Dreyfus in the reference 
cited, which provides substantial bibliographies for added reading. 

In conclusion, this example has illustrated Bellman’s Principle of 
Optimality, which has been stated as follows. 

Principle of Optimality 

An optimal policy has the property that, whatever the initial state and 
initial decision are, the remaining decisions must constitute an optimal 
policy with regard to the state resulting from the first decision. 33 

In our example, this principle states that for a given level of W in the 
three-stage problem, whatever decision is made about the level of th, an 
optimal policy must result in a “best” allocation for the remaining re- 
sources in Areas 1 and 2. This optimal policy is expressed in the functional 
equation last presented, and in the final result obtained for our problem. 
Thus, if $15 is allocated for five observations, or glimpses, in Area 3, then 
the optimal policy for that initial commitment must be to allocate optimally 

” A functional equation contains another function In this case f. depends on f, • 

“See, for example. Bellman and Dreyfus, op. cit., page 15. For additional problems in 
Dynamic Programing, see M. Sasieni, et a!.. Operations Research. Methods and Prob- 
lems _ Wdey, New York, I960. An understanding of the technique is enhanced by a 
Survey of the applications ated by Bellman and Dreyfus, which include multistage 
missile launchings, etc. 
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the remote (V15) to Areas 1 and 2, as *« done to obtain the optimal 

me AlftooghTe have made our computations from a one-stage to a three- 
stage problem (or, in general, through k stages), vse invoice the PAraple 
of OpUmaUty at each new stage, so that it U invoiced for the total Jc-stage 
system. 


problems 

yty 1 Signal tracing Is probably the most frequently used diagnostic technique 
when probabilistic or logical metbods-whfch cxruld relate symptoms to 
causes— fad to produce the required results, or when such methods are 
difficult or impossible to apply owing to insufficient data or other reasons. 
A List applications of the signal-tracing technique that are familiar to 
you. Imagine other useful applications. You need not restrict attention to 
a particular type of system; the method is general. 

R. An experiment is conducted in which a large group of subjects is 
split into two randomly selected smaller groups of equal size. A 15-minute 
lecture on signal-tracing techniques is given to subgroup 1; subgroup 2 
is left alone to read Life magazine for the same time period. Then both 
groups are asked to solve a diagnostic problem. In t very complex train 
of gears and shafts, one or more set-screws are loosened. Each subject is 
asked to find the loose gears. The solution time is then recorded for 
each subject By what ratio would you predict the average solution times 
to differ between groups 1 and 2? 


10 2 


An insurance firm has 200,000 annuity holders. Each month the file must 
be updated because collections, dividends, payments, terminations and 
additions, and payments on death affect the account balances of these 
individuals 


Suppose the annuity records are filed in numerical order from 000,001 
through 2CO.OOO To assure accurate updating, over-all control balances 
are computed for each block of 1000 individual accounts by computer, 
and separately, by hand, for the over -all list. The updating is assumed to 
be correct if the total balance of a block of 1000 records is within plus 
or minus 1 10 of the computed control total. In addition, the grand total 
computed from the detail is required to be within plus or minus $1.00 of 
the band-computed grand total. <The account variation cited in the 
check procedure is to allow for modest rounding-off errors introduced 
tiy toe compulation system in figuring the individual account balance.) 
A. What do you think of this checking system? 

1 } X ““moral mile, of the pnjjwsrt Hm g sy)tcm 
, .‘i 4 '"*" “ “" J f™> «« t«? WouM it miW much 

it there nrec out oaefly 1000 Individual, in a |„t btai? 
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C. If the balances do not check as required, the firm wants to find the 
account, or accounts, in error and make the appropriate corrections. How 
would you search for the error, or errors, in an efficient manner? 

10.3 An insurance company has a master file containing the names of 200,00 0 
policyholders arranged in alphabetical order with last name first. The 
insurance contracts for these individuals are filed by contract number, 
and one individual may have a number of policies for different types of 
insurance. 

A system is to be developed to answer rapidly mail and telephone in- 
quiries regarding a given individuals policies. Thus it is necessary to 
obtain an accurate cross reference between the alphabetical file and the 
contract serial file. 

Unfortunately, several problems arise. First, an individual making an 
inquiry seldom if ever remembers his policy number, or numbers. Second, 
Several individuals may have the same name and initials; e.g., the New 
York Manhattan telephone directory has 23 Juan Rodriguez listings with 
no initial and many more with one or more initials, To make matters 
worse, inquiries often are made with names at variance with the records 
(Juan S. Rodriguez may inquire simply as Juan Rodriguez, or vice versa), 
and indeed the original contracts for the same individual may have been 
written with a slightly different description (the policy may be [or J. S. 
Rodriguez). Other problems of misspelled names, phonetic misunder- 
standing, name changes due to marriage, divorce, or legal action add to 
the ambiguity of positive identification. 

Even though it would be possible to identify a given individual (and 
his contracts) uniquely vising subsidiary invariate information such as 
sex, date of birth, and place of birth, variable information such as present 
address and historical addresses, and added information such as social 
security number, the appropriate record or set of likely records must be 
physically found in the file before such subsidiary information can be 
checked to obtain a unique verification. 

In most information storage systems, the physical location problem 
required for initial screening can be a lengthy process, because the likely 
candidates for further check may be widely dispersed in the alphabetical 
file. 

For example, the Manhattan telephone directory has seven J. Rodri- 
guez listings, which are separated from the Juan Rodriguez listings by 
over 100 other Rodriguez possibilities ranging from Jacinto to Josephine— 
and extending thereafter through Juana to Justina. Moreover, the records 
or the inquiry may have the name spelled Rodriquez, which is several 
hundred listings later than the original Juan Rodriguez, or the spelling 
may have been given as Rodrigues, Rodragues, Rodreguez or Rodrigeuz, 
all of which are several hundred listings before Juan Rodriguez. In all, 
well over 1000 possibilities separate the first from the last reasonable 
possibility in the single telephone book cited, with many irrelevant listings 
in between. 
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system. 


PHOBIEMS 

10.1 Signal tracing is probably the most frequently used diagnostic technique 
when probabilistic or logical methods-vvhich could relate symptoms to 
causes-fad to produce the required results, or when such methods are 
difficult or impossible to apply owing to insufficient data or other reasons. 
A List applications of the signal-tracing technique that are familiar to 
you. Imagine other useful applications. You need not restrict attention to 
a particular type of system; the method is general- 

B An experiment is conducted in which a large group of subjects is 
split into two randomly selected smaller groups of equal size. A 15-minute 
lecture on signal-tracing techniques is given to subgroup I; subgroup 5 
is left 3lone to read Life magazine for the same time period. Then both 
groups are ashed to solve a diagnostic problem. In a very complex train 
of gears and shafts, one or more set-serews are loosened. Each subject is 
asked to find the loose gears. The solution time is then tecorded for 
each subject By what ratio would you predict the average solution times 
lo diSer between groups 1 and 2? 


10 2 


An insurance firm has 200,000 annuity holders. Each month the file must 
be updated because collections, dividends, payments, terminations and 
additions, and payments on death affect the account balances of these 
individuals 


Suppose the annuity records are filed in numerical order from 000,001 
through 200,000. To assure accurate updating, over-all control balances 
are computed for each block of 1000 individual accounts by computer, 
and separately, by hand, for the over-all list. The updating is assumed to 
be coned if the total balance of a block of 1000 records is within plus 
or minus S 30 of the computed control total. In addition, the grand total 
computed from (he detail is required to be within plus or minus $1.00 of 
the hand-computed grand total. (The account variation cited in the 
check procedure is to allow for modest rminding-off errors introduced 
by the computation system in figuring the individual account balances.) 
A. What do you think of this checking system? 

' h ' n " mwi “ 1 of the proposed Stas system he 
£ *'■£*- *° “ d 'M'lfom from die list? Would it make much 
tMerence If there were not erartiy 1000 todwiduils to a test Hook? 
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C. If the balances do not check as required, the firm wants to find the 
account, or accounts, in error and make the appropriate corrections. How 
would you search for the error, or errors, in an efficient manner? 

10.3 An insurance company has a master file containing the names of 200,000 
policyholders arranged in alphabetical order with last name first. The 
insurance contracts for these individuals are filed by contract number, 
and one individual may have a number of policies for different types of 
insurance. 

A system is to be developed to answer rapidly mail and telephone in- 
quiries regarding a given individual’s policies. Thus it is necessary to 
obtain an accurate cross reference between the alphabetical file and the 
contract serial file. 

Unfortunately, several problems arise. First, an individual making an 
inquiry seldom if ever remembers his policy number, or numbers. Second, 
several individuals may have the same name and initials; e,g., the New 
York Manhattan telephone directory has 23 Juan Rodriguez listings with 
no initial and many more with one or more initials. To make matters 
worse, inquiries often are made with names at variance with the records 
(Juan S. Rodriguez may inquire simply as Juan Rodriguez, or vice versa), 
and indeed the original contracts for the same individual may have been 
written with a slightly different description (the policy may be for J. S. 
Rodriguez). Other problems of misspelled names, phonetic misunder- 
standing, name changes due to marriage, divorce, or legal action add to 
the ambiguity of positive identification. 

Even though it would be possible to identify a given individual (and 
his contracts) uniquely using subsidiary invariate information such as 
sex, date of birth, and place of birth, variable information such as present 
address and historical addresses, and added information such as social 
security number, the appropriate record or set of likely records must be 
physically found in the file before such subsidiary information can be 
checked to obtain a unique verification. 

In most information storage systems, the physical location problem 
required for initial screening can be a lengthy process, because the likely 
candidates for further check may be widely dispersed in the alphabetical 
file. 

For example, the Manhattan telephone directory has seven J. Rodri- 
guez listings, which are separated from the Juan Rodriguez listings by 
over 100 other Rodriguez possibilities ranging from Jacinto to Josephine— 
and extending thereafter through Juana to Justina. Moreover, the records 
or the inquiry may have the name spelled Rodriquez, which is several 
hundred listings later than the original Juan Rodriguez, or the spelling 
may have been given as Rodrigues, Rodragues, Rodreguez or Rodrigeuz, 
all of which are several hundred listings before Juan Rodriguez. In all, 
well over 1000 possibilities separate the first from the last reasonable 
possibility in the single telephone book cited, with many irrelevant listings 
in between. 
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A number of solutions to this location problem are in general use: 
a search for phonetically similar names, a search in a severaHesd hierarchy 
If Z^L »i* increasingly reW (or ti C htened) contoons drc- 
tionarieS of alternate spellings or biciy possibilities. Htmnver, each ap- 
plication presents special problems, and a general optimum search pattern 
to locate an individual alphabetical name listing under practical conditions 
has not been found. 

A Suppose the insurance company has its alphabetical file stored in blocks 
of N names. You may consider the "block'* as a physical tray of N index 
cards. (Some of these index card locations may be empty'.) Or, the physi- 
cal form of storage may be of another “random access'* form, such as a 
magnetic card, or a track of N records on a magnetic disc or drum. Sup- 
pose the time ot cost of selecting and “withdrawing" the information in a 
given block is relatively large, but the cost of checking the names and 
subsidiary information iriihift the block, alter its selection is relatively 
small. (Yhis » the case \n many computer applications.} What arc some 
of the considerations that would determine the size of the block of N 
names and the first choice of the block for initial inspection? What alterna- 
tives would you suggest if a search of the first block selected fails? How 
would your answers be affected if the alphabetical file contained 
20,000,000 names instead of 200,000? 

B. How might you proceed in your search if the alphabetical index were 
stored on some nonrandom access medium, such as magnetic tape, which 
Idee a scroll requires strictly sequential search? 

C How would you update the alphabetical master file as new insurance 
contracts were sold? How would this affect your previous decisions and 
comments? 

D VVhat applications, other than the insurance search cited, raise similar 
questions? Under what conditions would an alphabetical search for a name 
be less severe than in the case of the insurance problem cited? 

E Why is the location problem illustrated here more difficult than the 
matching and location of names in the Amalgamated Dating Service case 
of Protect D? 

F How would ) our storage scheme and search process change if the name 
records were of vamb e length instead of fixed length, so the number of 
names in a block would be a variable, too. Instead of a constant? 

^ 4 m t B ‘ ^ r ?' r ta a 2 X 2 matrix. A lost object 

is known to be in one of the four boxes. 

*»?* * the probability of 

^ rmd® T » 1 th * **** * l«*irig in the boxes 

S e IrZ * r and ,f on the second loot the selection is again 
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B. What is the probability of finding the object in I, 2, 3, and 4 looks i( 
a box, once selected by the above method, is never inspected again if it is 
empty? (This corresponds to sampling without replacement.) 

10.5 Using the same four boxes as in the above problem, suppose the lost 
object moves between A, B, C, or D at random between each look. 

A. What is the probability that the object will be detected in N looks if 
the search consists of arbitrarily selecting a box and remaining there until 
the lost object arrives? 

B. What is the probability that the object will be detected in N looks if 
the search proceeds at random (with replacement)? 

C. What is the probability that the object will be detected in 1, 2, 3, 
and 4 looks if the search proceeds without replacement (the same box 
never looked into again if found empty originally)? Continue to assume 
the lost object may move to any one of the four boxes between looks with 
equal probability. 

10.6 Using the methods and data of the two-stage search for a valuable prize 
given on pp. 2S&-92, find the cost difference between the two-stage search 
method and the random search and acquisition method described if the 
reliability of the inexpensive search were increased to p = 0.5, where p 
is the probability of obtaining a true positive signal per look if the prize is 
present. Assume that the probability of obtaining a false positive signal 
per look is A = 0.1 per look, as in the example given, and that the search 
cost figures are as before. 

10.7 If, in the above problem, the reliability increase to p = 0.5 also increased 
the cost per look of the inexpensive first-stage look from $10,000 to 
$50,000 per look, would that increased cost be justified over the less 
reliable method (when p = 0.2)? 

10.8 From the results of the previous two problems, how large must the 
consolation prize be before the random search and acquisition method is 
as inexpensive as the two-stage search with p = 0.2 and p = 0.5? 

10.9 Ten prospectors are available to search for ore by one of two methods 
Aj and A 2 - At least two men must use Method A u and at least one man 
A 2 . In each case the search methods are such that the probability of 
finding ore by either method alone is given by the numbers in the table 
below. The total effort will be successful if either method is successful. 

A. What is the optimum allocation of manpower to the two methods to 
maximize the probability of finding ore with the resources we have? (In 
the table R t (x,) is *be probability of success when x men, O^x^lO, 
are assigned to Method 1, etc.) 
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tabu: I 

I 2 3 4 5 6 7 8 9 10 MM 

IS — TTTTT~7~T ~T~» ■» 

IStSi rTTTls JS8 ^9 .60 •** 

B. CotH the same problem be solved by OTOIber method? I! so, soSgBt 
wh at it might be. 

10 10 A ship is to be loaded to its maximum capacity, -which is 100 weight 
units. The following commodities, with their weight and value (pro&t) 
to us per unit as shown in the following table, ate available tor loading. 
We want to maximize the profit of the load, subject to the total weight 
constraint. 


Commodity l 

Weight tct 

Value to ■ 

1 

49 

20 

% 

50 

75 

3 

51 

102 


Although the obvious answer (for a feasible solution) is to use one unit 
of commodity 1 and one unit of commodity 3 (to obtain a weight of 100 
units) for a profit of 122 units, the true optimum is 150 units of profit, 
obtained by loading 2 units of commodity 2. 

A Solve this problem using dynamic programing to achieve the optimum 
of 150 units of profit. Assume for the purposes of this problem that there 
are no other constraints but total weight. 

B Would any change in your procedure be required for a larger number 
of commodities? 

C What changes in your procedure would be required if there were a 
total cubic footage restriction, assuming the cubic footage of each com- 
modity per unit were given? 

D Why is this a search problem? 

E Can ) ou thinlc of any other problems that would have the same pattern 
of definition and solution? 

(Adapted from R E Bellman and S E. Dreyfus, Applied Dynamic Pro- 
gramming Princeton University Press. Pnnceton, N.J., 1962, pp. 2S-32, 
where the solution may be found.] 


A system consists of three components in series, such that for the system 
to wort, each of the three components must worV. A proposal has been 
made to increase the reliability of the system by placing additional units 

w m , =>« to s ra-poreoB. In die 
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from experimental data obtained from an extensive life test to 1000 hours 
for a large sample of components. This number may be considered the 
reliability of the component to 1000 hours operation. The cost of each 
component is also given in the table. Using the method of dynamic pro- 
graming 

A. What is the maximum reliability configuration that can be obtained for 
the system if the total cost of the system is not to exceed $50.00? What 
is the system reliability (survival probability) to 1000 hours in this case? 

B. What is the minimum cost configuration that will produce a system 
reliability of 0.90 or greater? What is that minimum system cost? 

Component 


12 3 

0.G 0.8 0.9 Survival probability for individual components 

$5 $10 $15 Cost of each component 

(Note: for consistency of notation, let n< be the number of parallel com- 
ponents of type i, pi(ni) be the survival probability for the group of parallel 
n, components, let the cost of each component be wi, and the total system 
cost be W. Abo note that there must be at least one of each component 
type for the system to work.) 


PROJECT 6 

The Amalgamated Dating Service, Inc. (a fictional firm) seeks to match the 
characteristics of male and female students who want dates. A databank of one 
million names and coded characteristics is on hand for this purpose. The system 
consists of two major files: (1) A serial file and (2) an alphabetical file of names, 
addresses, and telephone numbers-plus serial numbers for cross reference to the 
characteristic serial file. 

Suppose the serial file has been ordered first into males and females, then into 
50 geographic categories. It is assumed that feasible matching is restricted to 
members of the opposite sex who live in the same area. The remaining detail on 
each name in the serial file consists of a five-digit serial number (to be used later 
after matching to extract selected names, addresses, and phone numbers from 
the address file), and 50 characteristics, each coded as a decimal position. Thus, 
each characteristic may have up to ten attributes, as indicated by the decimal 
numbers 0-9. Twenty-five positions show descriptive characteristics; twenty-five 
show desired characteristics. 

In summary, File (1), the serial master-file, consists of a decimal code of 58 
positions: Sex (1), geographical location (2). serial number (5), descriptive 
characteristics (25), and desired characteristics (25). The serial file is ordered 
by sex, location, and serial number, with sex the major key. 

File (2), the name-address-phone file, is arranged in sex-location-serial se- 
quence, with sex the major key (as in File (1)). 
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Tie initial procedure for matching is to take ail incoming rcquests nWA 
. „ « t, een -o^ed by the respondent in the fashion above, and order them m 

sex-location order. The company guarantees at least five names to each 
individual; therefore, the next step is to make a matching computaUon resfric^d 
to members of the opposite sex in the same location. Thus, further attention will 
be restricted to two blocks of individuals: the first block the requesting group, the 
second block the individuals in the master aerial file. The sex of the two blocks 
wall be different, but the location will he the same. Obviously, this restriction 
greatly reduces the number of possible matches that must be considered. 

To be specific in what follows, suppose that a requesting party is male, so 
that only matches with females in the same area wffi be considered. Also suppose 
further that the Eve required names will be those females who have the highest 
matching scores (or number of matches) as defined as follows: 

First, compute the number of matches between the last 25 characteristics of 
male code (desire) and the first 25 characteristics of the females (description). 

A match Will occur if and only if the two decimal digits in columns indexed as 
above we equal. 

Second, in addition to the score computed above, add to that total the num- 
ber of matches computed by using the last 25 positions of each female's code and 
the first 25 positions of the male code. 

This grand total will then represent the total number of mutual matches, 
which may be used for ranking females for selection for the male request. (The 
computation, of course, assumes that the ord-er of the 25 descriptive characteristics 
and the 25 desire characteristics is identical so that the computation just proposed 
will be meaningful.) 

The process of making the matching computation will be carried out for each 
male request. A similar procedure is to be carried out for each female request 
after the males have been completed. 

A Devise an appropriate detailed procedure for making the matching computa- 
tions and draw a block diagram of your method. 

B If some of ihe characteristics are thought to be mote important than others, 
how could this be taken care of? 

C Because you only want the highcst-score Sve names for each request, can 
you dense to added procedure to save only the serial numbers of the top Bee 
female scores found for each male request, and vice versa? 

D How Would the computing and memory facilities available affect your pro- 
p»ed computing and ranking scheme,? For emmple, would you lie to have . 

te'o.rSt nfmS^luhT *T* nOT °' V 0r 1,0,7 ™““ "™ld 

££ ^ storage? 1 ^ *" ' h ' «"* a give 
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Male 

sex/location/serial 


Female 

sex/location/serial 


xxxxxxxx 

xxxxxxxx 


i 


Ascending 

order 


xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx 

xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx 

Ascending order -» 


That is, for each male, handled in serial number order, you have also obtained 
the five top females, and have their serial numbers in order, too- With this set 
of data, you are now ready to print letters of introduction for each male. Can you 
suggest a method of doing this in an efficient manner? YVhat would be the ad- 
vantage of getting the serial numbers, names, addresses, and telephone numbers 
of females for a given region in high-speed random access memory? 

F. Is there any advantage in having two files instead of one? Can you suggest 
improvements in the scheme outlined here? 

G. In what ways might this whole operation be simplified? 

H. How does the Amalgamated Dating operation correspond to the document 
retrieval operation described in this chapter? 


logic and probability 
"W IN SYSTEMS DIAGNOSIS 


By all means the fastest diagnostic method is a shrewd guess. If the correct 
system fault or pattern of faults can be guessed and then verified by care- 
fully selected tests, the time and cost of diagnosis can be greatly reduced. 

In a malfunctioning 100-component system, for example, if it were 
possible to guess the faulty component, pick it out from the rest, and test 
its effect upon the system as a whole by substitution of a good part (or other 
isolation and confirmation procedures as discussed in Chapter 10), we would 
require only one test— the critical verification. 


GUESSING WITH LOGICAL PROOP 

Tho skilled diagnostician in any field seems to have this skill of relevant 
selection of possibilities— the ability to guess the answer to a problem and 
then to prove that guess w correct. Unfortunately, the process of formal 
education, which leans heavily upon methodical development of a subject 
and an orderly deduction of consequences, may inhibit an intuitive flair 
or willingness to develop shrewd guesses. To guess seems not quite re- 
spectable, and the investigator may expect that an orderly progression of 
steps must be followed in a rigid chain of logic to obtain the answer to 
his problem 
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Seldom, in fact, does diagnosis proceed in this way, particularly for 
the complex system. Even astute mathematicians, according to Polya, 1 
frequently guess the answer to a complex problem and then prove it right: 
If the equation balances, no matter how the solution was obtained, at 
least one solution has been found. Indeed, many numerical methods of 
mathematical analysis operate directly on sequential guessing and verifying 
of the guess. 

Example: Suppose we do not know the square root of the number 10, 
but we do know that it is somewhere between 3 and 4. We try 3 and 
find 3* — 9 too low. Then we try 4, and find 4 2 = 16 too high. We com- 
promise and try 3.2, and find that 3.2* — 10.24 is still a bit high, and 
that 3.1* = 9.61 is a bit too low. Working back and forth in this way, 
and checking each time, we can find the square root of 10 to any 
desirable accuracy. And, the closer our initial guess is to the true 
answer, the quicker we will reach the correct result by this method. 

The same form of guessing is useful in the diagnosis of complex systems. 
One cannot help but be amused by the advice of a physician of the 1800s 
who, in his book, Practical Diagnosis * advised 

If the woman is unmarried and there is no evidence of gastric disorder in her 
tongue, it is well to remember what Battey, of Georgia, said in regard to this con- 
dition: "Always believe a young unmarried woman with abdominal tumor, of high 
social position and unimpeachable virtue, if she has been watched over by a 
platonic and abstemious young cousin of the male persuasion while the mother to 
be out, to be pregnant.” 

The test is often easy if the hypothesis is right. 

The question arises, however, as to how the investigator may come to 
make shrewd guesses. Does he pull them out of thin air? Is there a guide? 
Are there any procedures that will improve his ability to make good 
guesses? In the past, experience has been the teacher. 

The analyst of relatively similar systems develops over a period of time 
a sense of diagnostic relevance that appears uncanny to the novice. Some 
pediatricians, for example, have examined a quarter of a million newborn 
infants. A look, a movement, a sound, a suspect coloration, or an abnor- 
mality in shape or relative size are diagnostic signals, to such a physician 
and, as a pattern, suggest hypotheses of health or illness. Diagnosticians 
in the medical profession, who treat the relatively unchanging human form, 
can build a wealth of experience of this type. 3 The skilled mechanic, 

*G. Polya, Mathematics and Plausible Reasoning, 2 Vols, Princeton University Press, 
Princeton, N.J., 1954. See the preface to Vol. 1. 

» Hobart Amory Hare, Practical Diagnosis, Lea Brothers & Co., Philadelphia, 1S99, p. 21. 

* For a diagnostician's popular discussion of the ability to observe see W. Alvarez, Danger 
Signals, Wilcox and Follett, Chicago, 1953. 
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electronic maintenance man, and management consultant also develop the 
ability to define and diagnose a familiar system s problems m short order. 
For example, the partner of a major management consulting firm once 
reported to the author that his better diagnosticians could get a quick 
fix" on most corporate problems in from 4 to 24 working hours, based on 
previous experience with similar firms and problems. In short, the ability 
to make rapid appraisals of a “new” situation is a valuable asset, all too 
rare in practice, that combines a high sensitivity to critical symptoms and 
a vast experience with problems of a given kind. 

There ate, however, several reasons why the ability to use logical and 
probabilistic aids to diagnosis can enhance the skills of the expert diag- 
nostician, and contribute markedly to the diagnostic skills of the less expert 
investigator. 

Fust, training in logical and probabilistic methods, if approached in 
the nght spurt, enhances the investigator's sensitivity to critical symptoms 
and probable causes. In a sense, relating symptoms and causes become a 
habit through practice. 

Second, a formal collection of symptom and cause relationships repre- 
sents a wealth of experience for consideration. For me difficulties such 
diagnostic tables may lead to a search for relevant symptoms and causes 
that bad not been considered before. 

Third, a formal use of logical and probabilistic methods may permit the 
detection of changed diagnostic conditions, the projection of possible 
symptom-fault relationship in newly designed systems, and an increase in 
the variety of diagnostic possibilities that would not be provided by ex- 
perience itself. 

In this chapter we illustrate a few formal methods of logical search 
and probabilistic selection that are useful in systems diagnosis and problem 
solving We will Erst introduce some background material, and then apply 
the suggested techniques 


BOOLEAN ALGEBRA USiHG DESIGNATION NUMBERS 

To further our discussion, we must first descend to the realm of formal 
logic Although man) excellent works are available on logical analysis, 
our presentation will avoid much of the usual background, and will stress 
manipulations most suitable for digital computers. This approach has the 
a vantage that onlj three logical manipulations need be learned before 
P * fe“ “ be and lie notation used can 

«pWd ,Zl t t T dCliS - mUm {which evill be 

l4 dtt*’ ,ha ” b> ' m ° r ‘ English statements or 
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In what follows we will first consider conclusions that are either tru 
or false, and then conclusions that are probably true (and therefore t 
some extent probably false ). 

This distinction may be compared to the previously discussed di£ferenc< 
between a deterministic machine and a probabilistic machine. 

For the deterministic machine, a given set of input conditions (or se 
quence of input conditions) inexorably leads to fixed and given conclu- 
sions (our outcomes), which depend logically upon the input condition* 
by virtue of the machines, or logical system s, structure. 

For the probabilistic machine, a given set of input conditions may lead 
to a range of possible output conditions. When the probabilistic relation- 
ships arc known, it is usually possible to compute a probability of occur- 
rence for each possible outcome, given the input conditions, (The converse 
may also be possible; i.e, for a given output condition, it may be possible 
to determine the probability that certain input conditions lead to an ob- 
served result.) 

The usual computer program, for example, is comparable to a deter- 
ministic machine, because a given output, or set of outputs, follows logically 
from a given input, or set of inputs. The coin-changing vending machine 
of Chapter 2 is another example. 

An example of a probabilistic machine is the market-share model illus- 
trated in Chapter 3, using switching matrices. 

It is important to note that both the deterministic machine and the 
probabilisic machine have structure, but that the two forms of structure 
are portrayed differently, and lead to different results. 

In the remainder of this section we will direct our discussion to the 
deterministic case. 


Statements 

Logical manipulations consist of statements and conclusions regarding 
other statements. Thus, we may make the statement the system has symp- 
tom number 1.” an observed fact. Similarly, we may make the statement 


, number 2," also an observed fact. Such logical 


“the system could , example, make the compound 


statements may symptom number I and symptom number 

“•thfcl"bsla^we would hope ,0 deduce the truth 


or 


from uus ««uyvu..v. — - <>» 

falsity of the statement, 'the system has cause number 3. 


Symbolic Notation 


Because 
adopt a 


the repetition of long statements in English is cumbersome, we 
horthand, calling the first statement “Statement A or simply A 
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,,a catting &e second statement 'SlMMirot 8' or simply ' a ' Beca ”‘ 
each of these statements may be either true (the symptom w “ , 

or fate (the symptom was not observed), each statement has a twth value 
T or F. Similarly, each possible conclusion has a troth value, T or F. 


Truth Tables, Designation Numbers, and the Logical Basis 

A complete listing of all T and F possibito.es for a group of N statements 
may be spelled out in tabular form. The statements are listed as rows ana 
the possible combinations of T and F are listed as columns. In genera , 
sw h a table will have N rows and 2 4 ' columns, so that the two-statement 
table will have ttio rows and four columns. 


Possible truth or falsity 
Statement^ F TFT 

Statement B F F T T 


If we wish to use a mechanized form of search, a binary notation is con-* 
veruent: 0 for False and I for True, where in mechanical terms the 0 may 
correspond to “no hole" in a card and the I to "a hole." The "hole” or "no 
hole" dichotomy could, of course, also orefer to a magnetized spot (or no 
spot) on computer tape, or similar binary coding in other memory media. 
For this reason two-valued logical manipulations arc particularly suitable 
foT computd search. Thus, for convenience, tt is possible to indicate the 
possible truth or falsity patterns of two statements as 

Possible truth or falsity 

Designation No. 1: Statement A 0 10 1 

Designation Mo. 1. Statement 8 0 0 1 i 

In this display, each column again corresponds to a possible combination 
of truth and falsity of Statement A and Statement B. Although it would be 
possible to arrange these columns in any order il all we wanted was a list 
of possibilities, .t is convenient for our later use to develop a standard, or 
canonical form, of display which generates the possibilities in the exact 
order shown. Note in the last display that the first row consists of alternate 
Os and Is, and the second consists of alternate pairs of ffs and i*s. If there 
were three statements A, B, and C, we could immediately generate the 
canonical table of possibilities as 

Possible truth or falsity 

Designation No. 1: Statement A 0 1 0 I 0 1 0 1 

Designation No, 2: Statement 8 00 1 1 0QU 

Designation No. 3. Statement C 0 0 0 0 1 1 H 
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where now there are 2* or 8 columns, and all possible truth and falsity 
combinations have been enumerated, as may be verified by inspection. 

When a table of truth and falsity possibilities has been constructed in 
the canonical form, it is called a complete logical basis. In addition, for 
convenience we call the series of 0 s and Is in the row for a particular 
statement the designation number for that statement. In later examples 
we will use the designation number of a statement to perform logical 
manipulations. 4 


The Truth or Falsify of a Compound Statement 

Manipulations with two-valued logic are frequently called Boolean Algebra 
after the mathematician George Boole 5 who first popularized symbolic 
logic. In most texts, such logical manipulations are illustrated by so-called 
Venn diagrams, made popular by the mathematician John Venn. 4 In elec- 
tronic and computer work, switching circuits or other analog elements are 
often used to illustrate the physical analog of logical statements. In what 
follows we will illustrate three fundamental logical manipulations by sym- 
bolic statements, Venn diagrams, and designation numbers. Thereafter we 
omit the drawings. From the three basic operations discussed; i.e., logical 
addition, logical multiplication, and logical negation , more complicated 
compound statements can be developed. 

Logical Addition 

Many of the fundamental logical manipulations appear to be trivial. How- 
ever, in breaking up difficult compound statements, the simple operations 
of logical addition, multiplication, and negation can be used to bring 
simplicity (T or F) out of apparent chaos. We thus treat the fundamentals 
first. 

The first compound statement of interest is denoted in works of logic 
as (A -f- B), which is taken to mean “ cither A or B is true, or both are 
true." For example, to use a typical logical phrase, we may want to deter- 
mine whether the following statement is True or False: “The object observed 
is either a dog or a black object, or a black dog.” Thus, if A represents the 
observation of a dog, and B represents the observation of a black object, the 
conclusion would be true if either A were true, or B were true, or if both 
were true. 

'To conform to the notation in R. S. Ledley, Programming and Utilizing Digital Com- 
puters, McGraw-Hill, New York, 1962, Chapters 8 and 10, the designation number of 
statement A will be denoted # A, where "•#“ stands for the phrase “designation num- 

* G. Boole, The Dates of Thought, London, 1854, Reprint, Dover, New York, 1951- 

* J, Venn, The Logic of Chance, Macmillan, London, 1888. 
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This statement of logical bet may be reduced to the designation number 
approach as follows. 

Statement A: Object observed is a dog n n i 1 

Statement B; Object observed is black u u A. A . 

Conclusion: Object either a dog, or black, or a black dog 0 111 
The table may be interpreted to read that the conclusion is true in every’ 
instance but one, namely, when neither a dog nor a black object has been 
observed (column 1 of the basis). The designation number for the con- 
clusion has been obtained by logical addition , which should not be con- 
fused with binaiy addition (i.e., the designation number for the conclu- 
sion is computed by adding according to the rules of logical addition: 1 + 

1 = 1, 0 + 1 = 1, 1+0*1, 04-0 = O.) Note that the designation num- 
ber for the conclusion, as shown above, indicates the truth or falsity of the 
conclusion for every' possible combination of 0 and 1, or F and T, for the 
original statements. Thus, although the: final designation number for the 
conclusion may be lengthy, it gives the logical result for any possible con- 
figuration of the logical input. Statements A and B. 

The Venn diagram for this compound statement is given in Figure 
11-la. Boundary A includes the set of all dogs. Boundary' B includes the 
set of all black objects. Two circles overlap to represent the set of black 
dogs. The compound statement (A + B) when true, is meant to include. 
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all objects within the areas A and B. It should be clear that, if (A + B) 
is false (e.g., if the subject is a white horse), the object would fall outside 
the circles in the Venn diagram. This possibility, as indicated above, corre- 
sponds to the first column of the complete logical basis composed of the 
designation numbers for Statements A and B. 

It is helpful to realize that the use of logical addition, or the + sign 
for the or in logic comes from the analogy between the logical operation 
(A + B) and arithmetic addition. For example, 

2 dog objects + 3 black objects — 5 objects 

where "objects” may be either dogs, or black, or black dogs. In effect, this 
operation corresponds to grouping items distinguished by specific charac- 
teristics under a more general category. 

Logical Multiplication 

Another compound statement of immediate interest is denoted by (A ■ B), 
which is taken to mean “Both A and B are true only if A is true and B is 
also true/’ For example, we would observe a black dog if and only if the 
object observed were A, a dog, and B, black. Thus, both A and B must be 
true if (A • B) is to be true. This result can be obtained from the complete 
logical basis by logical multiplication, which is also a column-by-column 
operation (i,e., 1x1 = 1, 0X1=0, 1X0 = 0, 0X0 = 0) as shown 
below. 


Statement A: Object observed is a dog 0 10 1 

Statement B: Object observed is black 0 0 11 

Conclusion: Object is a black dog 0 0 0 1 

As with logical addition, the logical product has its analogy with the 
arithmetic of multiplication. The parallel is most obvious in dimensional 
analysis in which two less-specific categories are combined to create a more- 
specific category. Thus, if one man works one day, we may describe the 
total amount of effort involved (which is a combination of men and time) 
as one man-day, a logical and combination. 

Logical Negation 

We can denote the falsity of a statement by placing a bar over the state- 
ment symbol. Thus, if a dog were not observed, we could denote that fact 
by A. The corresponding designation number for the negation of A is ob- 
tained by switching all 0’s to I's and all Is to 0’s in Jhe original designation 
number for A. A similar manipulation would give B's designation number, 
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Designation No. 1: (A + B) 1110 

Designation No. 2: (A + B) 0 111 

Designation No. 3: (A + B) • (A + B) 0 1 1 0 

The interpretation of this result is that (A + E) ■ (A + B) will be true 
if either of two conditions hold. These specific conditions are those indicated 
by the statements made in columns 2 and 3 of the original complete logical 
basis for Statements A and B, which was 

Statement A: Object observed is a dog 0 10 1 

Statement B : Obj'ect observed is black 0 0 11 

Tlius, (A + E) • (A + B) will be true if we have either 

(From column 2 of the basis): the object observed was a dog and not 
black, or 

(From column 3 of the basis) : the obj'ect observed was not a dog and black, 
and false under the other two conditions. 

The relative simplicity of the designation number approach when com- 
pared to manipulation of English phrases or symbols, should now be 
evident. 

Logical Identity 

Two logical statements are identical if and only if both have the same 
designation number. This important point often permits proof that two 
complex logical statements, in fact, mean exactly the same thing. 

Example: Using the same definitions of Statement A and Statement 
B as before, show the identity of the compound statements 

(A + B) • (A + B) and (A • B) + (A * B). 

We have already obtained the designation number of the left-hand 
expression as 0 1 1 0. If we can show that the designation number of 
the right-hand expression is also 0 110, we will have proved the 
identity. In Figure ll-2a we have already found the designation num- 
bers for 

(A • B) = 0 1 0 0 
(A - B) =0 0 1 0 

which upon logical addition yield 0 110- The two compound state- 
ments are thus logically equivalent ways of saying the same thing, con- 
firming the results of the previous section. 
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Logical Implication 

than .he latte, Thu, 

a black dog certainly implies that (a) a dog was seen, and ( ) 

* T«r!orTo'gica\ implication can ate be made using the Ration 
number approach and the following rule: The truth $J,”Z 

the truth of element Y if, and only if. the Destgnat.cn No. Jhm t unto 
(l’s) in at least those positions t there 1 s appear tn Designation No. X* 
Using the suggested example, we have 

(A • B)i Obiect was a dog and blaeh 0 0 0 1 (Designation No. X) 

A. Object seen was a dog 0 10 1 (Designation No. Y) 


Because Designation *Y has l's in at feast those positions that appear in 
Designation itX, the first statement's truth implies the truth of the second. 
We may then say that X implies Y, or X-.Y, where the arrow indicates 
logical implication. . 

To summarize what we have covered thus far, we inject a commercial 
example requiring logical analysis by designation numbers. 

Example; The following letter was written on September 1, 1965, by 
Joseph B. Jones, inventor, 4672 Jones Street, Jonesville, Michigan, and 
received on September 3, 1965, by Atlas Motors, Friendsville, Mississippi: 


Gentlemen: 

On August 3, 1965, 1 ordered your Number A463, i horsepower motors 
<§, $27 ea , delivered You assured me in confirmation that I would receive 
the motors in ten days if my credit was good, or in 48 hours if l sent cash. 

In addition, you indicated that there could be a delay if there were a 
Teamsters strike at your plant, or if a Civil Rights demonstration inter- 
fered with business Because my operation does not have a Dun & Brad- 
street rating, l immediately sent cash, which you acknowledged. I have 
heard that there were neither recent demonstrations, nor a Teamsters strike 
in FriendsvQle. So f am at a loss to explain why I do not have your ship- 
ment in hand. Flease advise at once. 

Very truly yours, 
Joseph B, Jones 

Jones got his cash back, due to an unexpected inventory shortage— -a 
possibility nos mentioned in the letter. But, from the available facts, 
how can sse use designation numbers to come to the same conclusion 
JU ^5 nei l e ' en though the conclusion is obvious from reading the letter)? 

The company made four statements (regarding credit, cash, a strike, 
ami a demonstration > Each of these “questions" could be answered yes 
or no, so that we can make up a table of statements and possible out- 
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comes, giving four designation numbers, one for each statement (Yes 


Statement Possible outcomes 


Credit Good 


0 10 1 

0 10 1 

0 10 1 

0 10 1 

Cash Sent 


0 0 11 

0 0 11 

0 0 11 

0 0 11 

Strike Called 

#C 

oooo 

1111 

oooo 

1111 

Demonstration 


oooo 

oooo 

1111 

1111 


Atlas Motors claimed the motors would be sent if credit were good, 
or if cash were sent, and if neither a strike nor a demonstration inter- 
fered with business. In shorthand form, that would be (A + B) • 
( C * D) = Y, where Y is shipment of the goods. 

Following the rules for designation numbers, we can see if what 
Jones did implied that shipment would be made. First we find #Y, the 
shipment designation number. 

Adding the designation numbers for (#A + #B) we obtain 
(#A + #B)=0111 0111 0111 0111 

And for • #£5) we obtain, remembering that negation reverses 
the 0's and l’s in the original designation numbers C and D 

■ # 0 ) =1 1 1 1 0000 0000 0 000 

And finally: 

= 0111 oooo oooo 0000 = #r 

expressing the requirements for shipment. We now find the input condi- 
tions X. 

The facts were that Jones sent cash, because he had no credit, and that 
no strike or demonstration took place. Thus, X = (A • B) • (C • D ). 
Following the procedure above, we may find the designation number for 

= 0010 0000 0000 0000 = #X, 

the designation number representing the conditions of the Jones trans- 
action. 

Because there are at least as many Is in the same position in #Y as 
there are in X -> Y, and the shipment should have been made. This Is 
an example of deducing the logical consequences of a set of facts and a 
series of logical statements using the designation number approach. From 
an input-output viewpoint, X represents the input to the system ( the Atlas 
Motors decision-making and order-handling operation), Y represents the 
output “shipment.” 
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Boolean Functions and English Statements 
from Designation Numbers 


The analyst may have arrived at a designation number that he wants to 
put into symbolic form (i.e., into a Boolean Function) or to convert into 
English. As we have seen, there are usually a number of equivalent com- 
pound statements. One problem that arises in equipment design is to find 
the simplest equivalent statement in terms of physical construction of a 
logical device. Although we will not pursue that route here, it is convenient 
to have one way of obtaining an equivalent symbolic or English statement. 
We will indicate one method here, and leave other methods to the suggested 
references and problems at the end of this chapter. 

Using what is known as the first canonical form, we may convert any 
designation number into an equivalent symbolic statement by noting from 
Figure ll-2a that the designation numbers for all possible products of two 
statements A and B are 


#(A.ii)=100 0 
Tr( A ■ 5) =0100 
#(X- B) =0 0 1 0 
#(A- B) = 0 0 0 1 
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situation. However, things are not nearly as bad as they seem when we 
remember that many combinations, which are logically possible, are not 
probable, or necessary, or feasible in practice. These constraints eliminate 
many columns from the complete logical basis, and produce a much 
smaller constrained logical basis, which may be handled as before. 

Thus, the logical distinction between a number of statement combina- 
tions may be immaterial if the presence of one true statement in the 
combination is considered to take precedence over the others, as in decision 
trees and sequential testing. A number of logically possible combinations 
may technically not be feasible, because they cannot be constructed or 
because a technological sequence of assembly or operation limits variety. 
Or, returning to the diagnostic problem, certain possible symptoms and 
causes may have never been observed in combination, so that the available 
data are constrained by experience or system knowledge. Indeed, many 
of the simplification techniques of Chapter 7 and 8 may be employed to 
reduce a complete logical basis to a constrained logical basis for applica- 
tion in a given case. An example will illustrate the point, and bring us back 
to our problem of logical diagnosis. 

Example: Suppose an order-processing department decides to make 
shipment of a product on the basis of three statements. A, B, and C. The 
complete logical basis for these statements would have three rows and 
eight columns, as shown below. 


Statement A: Credit Limit OK 
Statement B : Pay Experience Favorable 
Statement C: Special Clearance Obtained 


Shipment to be Made 


or 


Column No. 
12345678 

0 10 10 10 1 
0 0 1 1 0 0 1 1 
00001111 

NYYYYYYY 
01111111 
J J J / 


Suppose, in addition, that a shipment will be made if the credit limit is 
OK, or if the pay experience is favorable, or if special clearance has been 
obtained, or for any combination of these possibilities. The resulting action, 
is shown in the last row in decision-table form. This complete basis and 
the associated rules for shipment suggest immediate simplification, because 
if Statement A is true, the truth or falsity of Statements B and/ or C is im- 
material, and similarly, if A is false and B is true, then the truth or falsity 
of C is immaterial The columns so affected are indicated in the table above, 
and are columns 4, 6, 7, and 8. With these columns deleted and the columns 
rearranged in the order 2, 3, 5, 1, we have 3 new constrained basis in which, 
for example, columns 4, 6, and 8 have been combined with column 2, and 
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column 7 has bean combined svith column *■ to B* «* ^ulatton, the 
irrelevant table entries are indicated by a dash 

Old Column No. 


A: Credit OK. 

B; Pay Experience Favorable 
C: Special Clearance Obtained 
Shipment to be Made 
or 


2 3 5 1 
10 0 0 
-10 0 
- - 1 0 
V V V N 
1110 


In this MM derision-table format, all the tfs and l’t are relevant to the 
decision, but the ( - )’s are not. This new arrangement svould seem to pose 
special problems in view of the manipulations just discussed, became the 
(— )'s are ambiguous, and the standard order of the table columns has not 
been preserved. However, in fact, no great difficulty arises if we remember 
the rules for logical implication. 

To determine whether or not shipment should be made in a given case, 
let us renumber the columns in the decision tabic 1, 2, 3, 4, with the 
understanding that this definition will refer to the last table shown above. 
With this modification we may next call the four-place designation number 
for the shipment conditions #Y = 1 110, because %ve have a yes at the 
foot of the first three columns and a no at the foot of the fourth. We may 
also indicate the designation number of any incoming order in four 
columns. For example, if we take the designation number of statement A, 
as shown in the four-column table, as #A ~ 1 0 0 0, we see that A Y, 
because #Y has I’s m at least those positions where Is appear in #A. 
Again, suppose the new order exceeds the credit limit, but has favorable 
pay experience and also has obtained special clearance. As a Boolean 
(unction, this state of afiaire could be indicated by (A • B * C). How can 
this statement produce a designation number? If tee take the (— )'s in the 
table to be 0’s, we may proceed by logical multiplication and find 
#(A • B ’ C) = 0 0 0 0. remembering that the four-column designation 
number results from the constrained basis defined above. Again, because 
£>‘ has l's m at least those positions where - B • C( has Vs, (A-B- 
C } —* ( 1 ), and shipment should be made. Finally, we may ask what 
happens whew none of the tests can be passed by an incoming order, namely, 
an order having the condition (A • S ■ C). Agam assuming that {-Vs are 
0s, we find 
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#4 = 0 1 1 1 
#£=1011 
#C = 110 1 
#(£ * B-C) = 0 0 0 1 

and the shipment is not made, because #1’ does not now have I’s in at 
least those positions shown in #(A • £ ■ TT). 

Using designation numbers in this way reduces a series of tests to a 
simple matching operation. 


ALTERNATE SCREENING METHODS USING MASKING MATRICES 


We may also use an alternative procedure, which has computational ad- 
vantages in some cases. Using the same credit example, suppose we have 
the original table of statements and shipment conditions, and wish to have 
an automatic way of eliminating or “mashing” the (— )‘s, indicating "don't 
care" conditions entered in the decision table as 0’s. We may accomplish 
this by a masking matrix, or a table that has Is in the relevant test posi- 
tions and 0’s for the (-)'s, as shown below. 


Converted 
decision table 

Credit OK 10 0 0 

Pay Favorable 0 10 0 

Special Clearance 0 0 10 


Masking matrix 
1111 
0 111 
0 0 11 


Test 

data input 
0 
1 
1 


Because in this example shipment is made when any of the three condi- 
tions, represented by columns 1, 2, or 3 of the decision table, is met, a 
direct examination of the pertinent test conditions suggests itself. That is, 
because shipment depends upon logical addition of alternatives we can 
test input cases against successive columns of the decision table, and 
omit row manipulations. Some reflection will reveal that a column-match- 
ing approach may be efficient for two reasons. Because even the con- 
strained basis is likely to have more columns than rows, each test will 
probably involve fewer characters than the complete designation number 
of shipment conditions. And, if the column matching stops when the first 
suitable acceptance condition is met, we can shorten the column testing 
procedure by arranging the decision table to place the highly probable 
matches first. 

To illustrate the masking procedure, suppose that the test data input 
shown above in column form has been received, i.e., the order exceeds 
the credit limit, but has both favorabJe pay experience and special clear- 
ance. This data input does not match antj of the columns in the converted 
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Input against the ft* «ta» - Mto 

decision table, tie mastag »«<*; approach jails for 1 

logical multiplication of the data input by the ^column eWn 
the masking matrix, to produce the computed result shown. 


First column of 

Data input the masking matrix 


X 

X 

X 


Computed 

result 

0 

0 

0 


First column 
of decision table 
1 
0 
0 


This computed result is compared element by element to the first column 
of the decision table. Because there is not a perfect element-to-clement 
match, the test fails, and we proceed to column 1 . 


Second column of Computed 

Data input the masking matrix result 

OX 1 = 0 

IX 1 = i 

IX 0 = 0 


Second column 
of decision table 
0 
1 
0 


At the second column of the masking matrix, a perfect match is found 
between the computed result and the second column of the decision table, 
so that the testing stops, shipment is made, and the testing system is 
already to receive the next input. Note that in this procedure we are deal- 
ing with repeated manipulations with three binary digits, rather than with 
four as m the previous method. For most decision tables constructed from 
a constrained logical basis the character difference would be greater. Both 
methods are entirely equivalent logically, and the choice is a matter of 
convenience m a given case.* 


SYMPTOM-CAUSE COMPLEX TABLES 

We now tetssm to a slagnostic problem. Suppose Oat see decide to develop 
* diagnostic table. Figure U-3, that uni! relate all possible combinations 
oi too symptoms and two tames oi system failure. Although it may appear 
tutierent, this table ss identical with the complete logical basis discussed 
ptessously, because we will consider 08 symptom-cause coropiejses, repre- 
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senting all combinations of symptoms and causes present or absent. For 
this reason, the logical statements comprising the rows of the logical basis 
will consist of a list of possible symptoms plus a list of possible causes. Al- 
though for the two-symptom, two-cause problem, we would have four rows, 
for n symptoms and m causes there would be n + m rows and 2" +m columns 
in such a table of theoretical possibilities. Figure 11-3 shows such a com- 
plete logical basis for the two-symptom, two-cause problem. 

It is important to understand clearly that each column represents a 
symptom-cause complex that conceivably could occur. For example, the 
first column indicates the case when neither S(l) nor S(2) is observed in 
the absence of both causes Ofl) and D(2). Column 12 indicates that both 
symptoms S(l) and S(2) may be observed when the system has cause 
D(2) but not D(l). Column 16 indicates that both S(l) and S(2) may be 
observed when the system has D(l) and D(2) in combination, and so on. 
The sixteen columns cover all the logical possibilities. 

The table shown in Figure 11*3 is of substantial size for such a simple 
problem. Moreover, if the investigator had such a table before him, it 
would be worthless. Regardless of the combination of observed symptoms. 
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11-3 The canonical form of the symptom-cause complex table. The combination of 
the two symptoms, S(1) and S(2), produces a set of four possible symptom complexes, 
s', with / a= 0. 1. 2, 3, . . . shown In the second row of the table. The combination 
of’the two causes 0(1) and 0(2} also produces four cause complexes d, with I = 0, 1, 
2. 3, as shown by the four blocks at the foot of the table. The four symptom complexes 
and the lour cause complexes in combination produce 16 symptom-cause com- 
plexes, Ci, enumerated in the last row of the above table, by indexing each symptom 
complex within each cause complex from left to right. It is Important to remember in 
what follows that an upper case S or D refers to a single symptom or cause and that 
a lower case s or d refers to a “complex," or combination of Individual symptoms or 
causes, present or not. 
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the investigator could not diSerentiate, from the table, whether the syrop- 
tom(s) resulted from Came ( l), or Came (2), neither, or both. 

An important point to he lcanred from this simple exercise fe that 
kmledge imposes constiaints upon possibilities. !f al the possibilities 
we feasible, and if all the possibilities were equally hfcely, there would 
be no need to mate observations. A Sip of the coin would do in making the 


diagnosis. , < 

But experience, if it is organized at all, is never completely random, by 
definition. There are some symptom-cause complexes (columns in the Table 
of Figure 11-3) that just do not occur, and there are some that do occur 
in certain patterns. The knowledge of these specific constraints as they 
logically occur thus becomes a large part of diagnostic training, or expert* 
enoe. In short, education and training in diagnostic ability, as in other 
skills, assumes considerable system constraints. Learning also requires 
that such constraint will be relatively stable over time, or that possible 
changes in constraints can be predicted in advance. 

To continue with our example, let us assume that only the symptom- 
cause complexes of columns 6, 7, 8, 11, 12, 15, and 16 (in Figure 11-3) 
have been observed or -.itc feasible. With this knowledge we may delete 
all other columns of the complete basis and develop the constrained logical 
basis shown in Figure 11-4, consisting only of seven feasible or observed 
columns, representing symptom-cause complexes. 

With this table of constrained possibilities in mind, we may now receive 
a set of observations regarding S(l) and S(2), and seek to make a differen- 
tial diagnosis regarding D(1 ) and D(2). 
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For example, suppose the diagnostician receives a report or observes 
that the system has symptom S(l), but not symptom S( 2); i.e., the input 
to the logical search is S(l) • SfZ). What can be deduced from this 
observation and the available logical data? 

Using the constrained basis of Figure 11-4, and the corresponding 
designation numbers for S(l) and S(2), we compute the designation num- 
ber for S(1 ) * S(2), we compute the designation number for S(l) • S(2) 
to be 

#S(1) = 10 10 10 1 

# 5^3 =1000000 

#(S(1) -ST5)) = 1 0 0 0 0 0 0 

This result and the rule for logical implication tell the investigator that 
the system has cause D( 1) and not cause D{2), and not the combination 
of causes 0(1 ) and 0(2). In other words, (in this case) a unique diagnosis 
of a single cause can be made from the set of two symptoms. 


logical diagnosis from symptom-cause tables 

To illustrate this conclusion, we may approach the data in two ways: By 
row or by column. 

First, the row designation number of S(l) • ’5(2} = 1000000 has 
a “1” only in the first column, which leads us directly to the first column 
of the constrained basis and the symptom-cause complex involving 
S(l) • S(2) and P(l) • P (2) . Thus, we have a unique and immediate 
diagnosis: D(l) • D(2). 

Second, we could have scanned the symptom-complex rotes column-by- 
column [checking S(l) ■ S(2) for a match] and again found that only 
column 1 matches the stated conditions. The corresponding cause complex 
in column 1, D(l) • 2T(27, is the unique diagnosis for the stated symptoms. 
(As indicated in the previous credit example, the methods are equivalent. 
A masking matrix is obviously not needed here, because none of the symp- 
tom conditions are irrelevant to the diagnosis. ) 

It is important for what follows to note that if the observation of symp- 
toms had revealed either S(l) or S(2) alone without a statement about the 
other, or if the observation had revealed the combination S(l) • S( 2), a 
unique diagnosis could not have been achieved. 

To spell this out, if the observation had revealed S(l) only, without 
any statement about S(2) being definitely true or false, a scan of the symp- 
tom-cause complexes or logical manipulation, as indicated above, would 
have indicated that the system could have had cause D(l), or D(2), or 
both D(l) and D(2) in combination. A similar result would have been 
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U. ■ .j -r vm hua been observed wilboul any information about S(l). 

the would have been obtained from the given cm- 

^ ' f' , . . ci-i) and SfOt had definitely been observed jn combina 

r U ^ “) not w aided in the difeen- 

BaJ diagnosis of D(l) versus 0(2) or 0(1) • D (.)■ 

ACTIONS WHEN DIFFERENTIAL DIAGNOSIS IS NOT UNIQUE 

From what has just been said, it should be clear Hurt tbe observation of 
a given symptom complex may not lead to a unique diagnosis and tfiat, 
in fact, the symptoms observed may or may not narrow the field of choice- 
Tbs situation, of course, is particularly evident when several cause com- 
plexes have a large number of similar symptoms- 

Two routes are open to the investigator when the results of a prelimi- 
nary diagnosis are ambiguous: (1) He may collect more symptoms, he-, 
make additional tests and observations which, hopefully, will farther 
narrow the field, or (2} he may resort to a probabilistic choice, and pre- 
scribe. treatment on that basis. We will discuss each of these approaches in 
turn. 


Wore Testa From Symptom-Cause Tables 

First, note that it would be impractical, became of tabular size, to include 
m one array all the possible symptom-cause complexes for a system of 
any great sophistication. Fortunately, such grand arrays are not necessary, 
because clustering i-Bects tn symptom -cause-complex data allow' large tables 
to be broken up into smaller ones for specialized use. Rudimentary screen- 
ing of a few symptoms would usually narrow the field sufficiently to 
indicate which special, smaller table should be consulted for more detailed 
search. Moreover, usually only detailed diSerentiation of causes presents 
any great diagnostic difficulty. 

Second, clearly the collection of some symptoms, or the application of 
some tests (which is the same thing), is easier than others. For example, 
an interview of the patient is easier than a physical examination, for both 
the patient and the physician, and both of these symptom-gathering 
methods are easier than laboratory tests or exploratory surgery. Similarly, 
for hardware systems, outside tests are usually easier to perform than inside 
tests, so that the investigator would usually prefer to take the easier 
outside alternatives first. Similar remarks could be made about organiza- 
and systems investigations generally. In short, the 
* , lga °5 j* 1 UJ uaIIy prefer to collect symptoms tn a certain order, 

a n rrr,W, by °* a PP 1lcalion ' tlme > linger to the system, 

and to select within this framewo** ««*«* -u . 3 

In practice, the investigator i 


s diagnostic tables not only to lead him 
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to a unique diagnosis, but to do so in an effective manner by indicating 
symptoms or tests critical to the differential diagnosis for each “stage” of 
cost, ease, or time criteria. 

To illustrate how this can be accomplished, we must resort to a more 
extensive symptom-cause-complex table, such as the one shown in Figure 
11-5. This table consists of 16 symptom-cause complexes (as columns) and 
an array of 14 symptom possibilities and eight cause possibilities (as rows). 
For illustration, we will assume a medical example in which the system is 
a doctor s patient, and the cause of the possible symptoms are diseases. 
Thus, the table of Figure 11-5 may be thought of as a tabulation of symp- 
tom-disease complexes, and the constraints exhibited in the table's con- 
figuration representative of medical knowledge about the eight diseases 
and their associate symptoms. 

Using such a table, in which the diagnostic tests have been ranked in 
the order of their difficulty, we may see how the investigation would 
proceed. 

First, the physician examines the patient's history and interviews him. 
Observation of the first three lines of Figure 11-5 indicates that the collec- 
tion of S(l) provides no information useful to the diagnosis, because eight 
diseases show that symptom. The physician thus looks for S(2) and S(3), 
which will do some good. 

Suppose the patient has S(2) and S(3) in combination. This fact throws 
out symptom-disease complex columns 12-16, and thereby narrows the 
field initially. 

Next the physician proceeds with the physical examination. Of the 
possible symptoms S(4), S(5), and S(6), only two are necessary, because 
three will do no better at differentiation than two, as may be seen by scan- 
ning rows 4, 5, and 6 of Figure 11-5. Suppose, after physical examination, 
the patient has S(4) and not S( 5). Because only columns 3-9 pertain to 
this combination (by a scan of rows 4 and 5), possible symptom-disease 
complexes, 1, 2, 10, and 11 may now be eliminated. 

The choice of the blood tests S( 7) through S(12) is more difficult. As 
a general rule at this stage, assuming all the tests were equally difficult, 
the physician would look for the smallest number of tests that would pro- 
vide a unique diagnosis, if this were possible. (If not, he would look for 
tests that would most reduce the possibilities. ) In this case, there are 
seven symptom-disease possibilities. If each blood test is reported positive 
or negative, we may observe ( from our binary guessing game of Chapter 5, 
page 119) that the smallest number of tests that could discriminate between 
seven alternatives is the nearest integer greater than log 2 7, or 3. (2 s is eight.) 

It might take more tests than this, but three tests are the minimum. Thus, the 
physician would look for a combination of the three blood tests that will 
accomplish this discrimination. The candidates for this selection, again 
remembering the binary guessing game, would be tests whose rows showed 
an approximately equal number of 0's and l’s. A scan of the possibilities 
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Column numbers: 12 3 4 
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11-5 A aymptom-cause-complex table consisting of 14 symptoms and 8 causes. This 
constrained logical basis shows testa arranged t* the order of progressive difficulty. 
A medical illustration is used, although the same procedure works for other systems. 
Alter each set of diagnostic te3ts or symptoms collected, the possible diagnoses are 
reviewed to determine what tests to do next. {From Programing and Utilizing Digital 
Comm by B S ts*«y Copyright 1862 by McGraw-Hill Book Company. Used by 
permission, page 341.) 


indicate! that S(9), S{10}, and S(12) have this desirable distribution, so 
they are selected- As a check for unique discrimination, we compare the 
three candidates, as below, i n which the column of Figure 11-5 is indicated, 
Test Column 

3 A S ft 7 8 9 14 

s (9) i i TTriTo 0 

S(I0) 1 101Q00 1 
S(12) 1 0 1 1 o 0 l 0 
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Note in the set of numbers above that a]] columns are unique, and that 
the columns shown represent an exhaustive list of all the 2 3 possibilities 
for three statements. The possibility of obtaining the test result shown in 
column 14 has previously been eliminated by the physical examination, so 
that the application of the three blood tests in this case will provide a unique 
differentiation between the symptom disease complexes numbered 3-9. 

Continuing, suppose the laboratory report shows that the patient is 
positive on S(9) and S(12) and negative on S(10); i.e., the patient has 
symptoms 5(9) and 5(12), but not 5(10). This result then points to the 
symptom-disease complex of column 5, which is a unique diagnosis. Finally, 
reference to the disease complex of column 5 (at the bottom of Figure 
11-5) shows that the patient has both D(4) and D(5), but none of the 
other diseases. The bone marrow test is not needed. 

This selection sequence shows how the investigator can use diagnostic 
tables of the sort shown to guide his selection of tests to isolate a complex 
of diseases. As a further example, it is interesting to note that if (instead 
of the sequence of events described above) the patient had shown both 
S(4) and S(S) upon physical examination, the investigator could immedi- 
ately narrow his diagnosis to the S}7nptom-disease complexes of columns 1 
and 2 in Figure 11-5. However, here the blood tests would have been 
inadequate because they could not differentiate between the two possibili- 
ties. Both bone marrow tests S(13) and S(14) would have been required 
to come to a final and unique diagnosis. 

The Use of Probability Tables In Diagnosis 

By necessity or by choice, the logical diagnostic sequence described above 
may not come to a unique conclusion. 

For example, because of cost, time, or physical restrictions, further 
diagnostic tests upon a system may be impossible, uneconomic, or un- 
necessary. 

First, it is often possible to strike an economic balance between the 
cost of further diagnostic tests and the cost of alternate treatments. A more 
expensive “gross” treatment may be preferred to a continuation of further 
tests, say, when a module in a system can be replaced at less cost than a 
component within that module plus the cost of finding the faulty component 
A similar situation occurs for the physician when a more expensive “broad 
spectrum” pharmaceutical will treat a block of undifferentiated ailments, 
within which further diagnosis might isolate the particular offending cause 
complex, but only at further expense, time, and difficulty. Here the possi- 
bility of module or block treatment suggests that diagnosis stop when the 
required block of cause complexes has been isolated. 

Second, either because the complete range of diagnostic tests required 
for logical isolation is not known, or because time and costs require it, 
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diaanosis may again terminate u/ien block treatment is net possible. In iKs 
iiSfc treatment may In required even though complete 
differentiation of possible cause complexes is not possible. ^ 

In the latter case, a probabilistic diagnosis is often made in an iterative 
procedure. A tentative diagnosis is made, based upon choice of the most 
probable diagnosis. A suitable treatment is used to remove the hypothe- 
sized cause complex. If the treatment proves satisfactory, the iteration 
steps. Should the treatment fail to remove the symptom complex exhibited 
by the system, that failure is used as an additional test result, the prob- 
abilities of alternate diagnostic possibilities are re-evaluated, and the cycle 
repeats itself until (a) the system difficulty is removed, (b) all possible 
treatments, or all blown alternative diagnostic possibilities have been ex- 
hausted, or (c) the system “expires” or shows new diagnostic symptoms. 

( For example, the stress methods discussed in Chapter 10 are often used 
in hardware systems to produce symptoms that are more easily diagnosed.) 

It is important to understand that the probabilistic evaluations made at 
each stage of this dynamic diagnosis-treatment iteration will not in ony icesy 
necessarily indicate a trend toward the final diagnosis. 

“For example, consider a patient with a disease for which there is a 
specific test. Until that test is made, the probability of the patient’s having 
the disease might actually grow smaller and smaller with each iteration, 
but after the test is made, this probability will rise to unity! On the other 
hand, when it becomes infeasible to maVe additional tests, the probabilities 
must play an important role in the choice of treatment” * 

It is now beneficial once again to detour slightly from our main theme 
and review the construction of probability tables that may be used in the 
diagnostic-treatment iteration. 

Returning to the logical table of Figure 11-4, suppose that the patient 
had symptoms S{\) and SI, 2) in combination, so that he may have any of 
tlw three symptom-cause complexes CJ, C*. or CJ. To make the probabilistic 
diagnosis, suppose that a historical record o! case histories indicated a dis- 
tnbmion of results shown in F.prre II*. This record relates the observed 
distribution of cases to the constrained logical basis used previously. 

For probabilistic computation, it is convenient to rearrange these data 
mto the two-way frequency table shown in Figure ll-6b, which is a stats- 

nl ™ ", S ' aHsUc3 ' f plays " Elch » *0 body of Figure 

ll-6b con-esponds to one of she 16 possible symptom-cause complex col- 

th "w '"t ?’ mp r, , e , 'HP™' W ot F 'S ure 11-3. The nonzero cutties hr 
the body oi Figure ll-6b correspond to the seven symptom-couse complex 

* L«5Vy. op cU . p 342 

u Jfwmv of McCraw- 

.. stacied judrm, Jnd «d, McC»w-IBn, 
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11»6 Experimental development of diagnostic probability tables. As an example, it is 
assumed 2400 cases have been observed from the same population. The number of 
cases observed for each symptom-cause complex Is shown, (a) The constrained logi- 
cal basis showing the number of cases observed for each symptom-cause complex. 
(b) An alternate display for the data of (a). 


Columns of the constrained logical basis of Figure 11-4. (It is from this form 
of historical record that the constrained logical basis may be constructed 
from the complete logical basis.) The row and column totals for each 
symptom complex and each cause complex are shown in Figure ll-6b. 

To convert the observed frequencies to probabilities, we now divide 
all entries in Figure 11 -6b by the total number of cases observed, 2400. 
This produces the result shown in Figure ll-7a. The resulting probabilities 
in the body of Figure ll-7a are the probabilities that given symptom -cause 
complexes, C*» have been observed. The probabilities obtained from the 
row and column totals, often called the marginal probabilities, represent 
the probability that a given cause complex has been observed, i,e., P(d /), 
and that a given symptom complex has been observed, i.e., F(s i ). For 
reference the symptom-cause complex probabilities, P(C *), are shown in 
Figure ll-7b in the form of the constrained logical basis. 

Now, using either Figure ll-6b or ll-7a we can compute the condi- 
tional probability that a particular symptom complex will be observed in 
the presence of a given (or known) cause complex, i.e., P(s‘\dj) as shown 
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il-7 Computation of probabilities from observations, (a) Conversion of observations 
to probabilities When ell the celts to Fig. ll*€b era divided by the total number of 
observations <2< 00), (a) results The body of this table shows the values of P{CJ>, the 
probability that a given symptom-cause complex will appear. the probability of 
observing a given symptom-complex, and P(d,). the probability that a given cause 
complex v*m be observed, ate shown In the outer row and column, respectively, (b) Art 
alternate display of the probabilities In the body Of (aj showing values of A{C|} as- 
sociated with each column ol the constrained logical basis. The same result would 
have been obtained from Fig 1 1-6a by dividing the last row of that figure by 2400. 


m Figure U-8ft, by dividing each row in Figure ll-6b or ll-7a by tbe row 
total. Similarly, we can compute the conditional probability that a particular 
cause complex will be observed in the presence of a green (or known) 
symptom complex, i.e., P(d,ls‘) as shown in Figure ll-Sb, by dividing each 
column in Figure ll-6b or ll-7a by the column total. 

As an illustration of the use of these tabulations, consider the problem 
of the patient with S( 1 ) and S(2), which corresponds to the set of symptom* 
cause complexes defined as C», where the values of ( range over those 
disease complexes associated with symptom complex i 3 in the logical basis. 
Column 3 of Figure ll-Sb gives the values of Pfdjjs 3 ) and from this tabula- 
tion we see that P(d^) = *, P(d,Ja*) = an d P(d,|s*) In this 
problem it is equally likely that the patient has D(l) *75T£> or 
B(V> • DU), and less likely that he has the combination D{X) • D(2>, 
corresponding to d,. Although, from the available data, a single choice is not 
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11-8 Conditions! probabilities obtained from Figure 11-6 (b). (a) Values of the 
conditional probabilities P(s' J d,). (Note that all rows of the body of this table add to 
1 . 0 .) Division of each row by its row tota! produces the conditional probability P(s‘ ) d,) 
in the body of the table. P(s‘ j d ,) is the probability that symptom complex / will be 
observed, given the truth of d,. P(s'} also results from this computation, as shown, 
(b) Values of the conditional probabilities P{d, )s'). (Note that all columns of the body 
of this table add to 1 . 0 .) Division of each column by its column total produces the 
conditional probability P[d t j s') in the body of the table. P(d, J s') is the probability that 
cause complex / is present, given the observation of symptom complex /. P(d,) also 
results from this computation. 


indicated by the probability figures, it is clear from the data that the chance 
of having both D( 1) and D( 2) is less likely than having a single disease. 

It is worthwhile to note that the above conditional probability figures 
could have been obtained directly from the data of Figure II-7a as follows: 

P(Cj) & 2 _ 2 

P(di |s3) “ P(C?) 4- P(C\) + P(Cl) A + A + A 343+2 8 

and so on for the other values in column 3 of Figure ll-S(b). 

Also, because the fraction -<h is common to both the numerator and 
denominator of the ratio, it may be eliminated for simplicity. We are only 
interested in proportions. Thus, 
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p ( d 'W “ F+T+2 ~ s 

?(*!«*) - 3 4- 3 + 2 “ g 
f " 3 + 3 + 2 “ I 


Total « | « 1.0 

(Note that the conditional probability values obtained are larger than the 
symptom-complex probabilities C\, C£, and C*.) 

Finally, it vriB be useful for what follows to introduce another way to 
compute conditional probabilities, namely, by Bayes' Formula* 1 which for 
is 




P(WJ 
£ pyjWi)’ 

i 


where the sum ia the denominator includes all disease complexes associated 
with symptom complex a 3 . The denominator of Bayes’ Formula is a normal- 
izing factor to make the sum of all conditional probabilities involving the 
d,‘s (given s 3 } add to 1.0. If it were only necessary to rank the d/s in order 
oi probability of occurrence, we could let the denominator he any convenient 
number. To illustrate, we will compute the denominator of Bayes’ Formula 
forPldijs*) as 


d ~ pwofw,) + p(d,)p(*w + m)?w) 

- + (i)(§) + «xi) - o). 

Using this same denominator repeatedly, we find the values in column 3 
oi Figure 11 -8b 


, WW _ «>(!) 3 

<!) “ «) “5 

P {A | rt _ IWW «)(>.) 3 

tt> IT”? 

pu.m - fWf W*) _ (»«) 2 

«> ~<r ” s 
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From these values it is clear that the ranh order of the values P(rfj|s») may 
be obtained simply from the numerator if there is no need to normalize. In 
general, all of the values in Figure II-S(b) could have been obtained in 
this way, because 


w - 


P(J,)P(»*|d,) 

£ P(i)P(s-|A> ' 


In an exactly analogous way it would be possible to compute the con- 
ditional probability of finding a particular symptom complex with a given 
cause complex by using the Bayes’ Formula 


P(<W - 


F(«’)p(d < lf) 

T, P(s‘)P(d,|s‘) ' 


For example, remembering that the denominator of the expression covers 
only the cause complexes associated with the given symptom complexes, 
we have for cause complex 3: 


5 P(s‘)P(d,|>*) = (A) Mr) + <*)(« = }-•£>. 

so that 


PW,) 


(tf)W _ 2 

(1) 3 

(A)(1) _ 1 

3 


corresponding to the two values shown in row 3 of Figure ll-8a. 

(The computations just made using Bayes’ Formula may be verified by 
inspection of Figures ll-8a and b. The formula follows directly from the 
construction of these tables.) 

The reader may wonder why the algebraically more complicated Bayes’ 
calculation has been introduced when easier row and column manipula- 
tions can be used to obtain Figure ll-8a and b. The answer will become 
evident when we consider diagnostic applications of the tables. 

Suppose we could report diagnostic data in the form of Table ll-8a or 
b, which is of course true. Which display is more desirable? 

At first we would say that Figure ll-8b is better because it gives the 
answer we want in a particular diagnosis, namely, the probability that the 
system has a given cause complex, for observation of a given symptom 
complex. For example, for any observed symptom complex, a column in 
Figure ll-8b may be selected and the resulting diagnostic probabilities 
are given directly. 
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VJniBttmtdy, however, the conditional probability Pf^') is **S«* 
to the characteristics of the particular population used to compile the 
statistics A small epidemic of causes of a particular type can greatly distort 
*e film's for a tL. Systems employed or Thong" in one on— n 
could not be expected necessarily to produce the same probabilities as 
those in another. la other words, the data of Figure 11-Sb might well be 
unstable from time to time, and from place to place. 

On the other hand, the tahle of Figure ll*8a, which gives the probability 
that given symptom complexes will appear if it is known that the system 
has cause complex d h is less likely to change with time: The symptom 
complexes arise from a given cause complex, not vice versa. In medical 
terminology, the etiology of the symptoms is related to or stems from the 
disease. It is for this reason that medical texts discuss P(s‘jd/) rather than 
Pfd/js*).” The same argument holds for other trouble-shooting problems. 
Although we want to find P{d,jj'), the constant relationship between 
symptom complex and cause complex is given by P(s‘{d/}, The required 
conversion may be obtained from Bayes’ Formula. 

The numerator of the Bayes’ Formula P(d/)P(3*}d/) contains two parts. 
The first term P(d f ) is the probability that a given cause complex will be 
observed, and is a characteristic of a given population at a given time. For 
example, the physician may know that there is flu about at a given time 
in bis community, and thus, be will heavily weight the possibility of flu 
in a list of possible diagnoses when a patient calls up complaining of 
lassitude, mild fever, stomach cramps, headache, diarrhea, and nasal con- 
gestion (although other cause complexes could lead to the same symptom 
complex). The second term, P(a‘|d,), essentially independent of the current 
patient population, provides the second component of the diagnosis: The 
structural knowledge of symptoms and causes. The combination of the two 
provides a probabilistic guide to current diagnosis of the particular case 
at hand. 

Eiompie Suppose an investigator observes the symptom complex s* 
given by Figure U-6a, the patient has both symptoms S(I) and S(2). 
The conditional probabilities of this symptom complex P( s’ld.) are given 
m column 3 of Figure ll-8a. Abo suppose that loioivledge of the ament 
popAtwncause complexes g,ves values of P(d,), also shown in Figure 
n-9. (These values (ot P(d,) diffe, from those shown in Figure 1Mb 
“ ''‘ S ' r " > ' P' , P ,,ha °'b « a different* 1 time.} ’ 
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<*, P(d,) 

di i i 

d* i i 

d* 1 I 

11-9 Probabilities required for calculation. The values of P(s’ 1 1 //) are taken from 
Figure 11-8a. column 3, and represent the constant relationship between symptoms 
and given causes. On the other hand, the values of P(cf,) are obtained from current 
observation of the population and will vary with time and place. 


P(d^) 




l 

1DD 


mis’) - 


D 


_1_ 
4 D 


P(dt\s*) = 


mi) 

D 


1 

12 D 


which, by observation, gives a rank order for diagnosis. Or, computing 
the denominator D as before, we have 

mis’) = A 
JW) - if 
PW) - iV 

which gives the probabilities which most add to 1.0. This result indicates 
that the cause complex d 2 is considerably more likely than the other 
possibilities, a conclusion derived from both current knowledge of cause- 
complex probability and knowledge of the constant relationship between 
symptom and cause complexes. 

A further look at Bayes’ Formula is instructive. The numerator, which 
represents the information we want for ranking cause complexes, consists 
of the two terms P(d ,) and F(s‘|d>). The data used to obtain these num- 
bers must be collected, and it is important to note that the population from 
which the collection is made in the two cases is different. 

First, in developing a set of probability values classified as symptom 
complexes for a given cause complex, as shown in Figure ll-8a for P(s'jd/), 
it is desirable to have data on as many cases as possible so that symptom 
complexes that are rarely associated with a given cause complex will show 
up in the experimental data, and therefore in the diagnostic table. In short, 
to develop tables for P(s'|d/) we pool all our experience. The greater this 
experience and the wider the diversity of conditions of observation, the 
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more faith we are likely to have in the constrained basis or conditional 
probability table that results. This is the reason, for example, that vast 
collections of symptom-cause complexes have been assembled in specialized 
medical fields, such as rare blood diseases, and have been subjected to 
mechanized treatment. 

just the opposite view must be taken in the development of the values 
r(d,) the probabilities that a given disease complex will be found in a 
particular population at a particular lime. The usefulness of Fid.) in 
Bayes Formula resides in its specificity to the patient at hand. For this 
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was done in Figure 11-5, and continue the iteration until some narrowing 
can be accomplished, or until a gross treatment can be found to cover the 
undifferentiated diagnostic possibilities. 

Thus, both skills, symptom collection and system classification, are im- 
portant to the investigator, the more important in a given case depending 
upon the statistical distributions of P(d s ) and P(s^d f ) that must be used 
for differentiation. It is, however, important to note that the distribution of 
P(dj) is seldom uniform in a general population of systems, nor identical 
for its subgroups, nor unchanging with time, so that the stock in trade of 
the professional diagnostician is a quick ability to classify systems by age, 
conditions of use, and similar characteristics of current importance. Then 
an extensive knowledge of constant symptom-cause relationships may be 
exploited most rapidly. 

There is a statistical argument and foundation for the diagnostic “guess- 
ing” ability mentioned at the start of this chapter. With a shrewd guess of 
a few highly likely cause complexes obtained by patient classification, it is 
possible, either by reference to the constrained logical basis, to tables of 
such as Figure I2-8a, or even to verbal listings or personal mem- 
ory, to select discriminatory tests to confirm the initial hypotheses. Although 
this procedure may fail in a small percentage of cases, calling for more 
formal analysis, in the main it will not, and net savings in time and effort 
will result from repeated use of this strategy. 


SUMMARY COMMENTS 

In this chapter we have discussed the organization and use of logical and 
probabilistic tables for diagnosis. We emphasized that the diagnostic proce- 
dure is often an iteration in which symptom collection, tentative diagnosis, 
and treatment follow in a circular pattern which does not necessarily termi- 
nate in one cycle. 

Although the presentation is highly simplified and reduced to apparently 
trivial examples, the logical and probabilistic manipulations indicated may 
easily be extended to more extensive arrays of symptoms and causes, and 
to more sophisticated forms of manipulation. For example, we can deal with 
matrices of logical statements, still using the designation number approach. 11 
The arguments are essentially unchanged. 

Our intent was also to stress the importance of orderly organization of 
quantitative diagnostic data, which is required for use of the techniques 
described. In many cases in which such data would be of great value, such 
as equipment maintenance by field mechanics, maintenance records and 
quantitative diagnostic data are not available or are so poorly organized as 
to he useless. Certainly, effective training of maintenance men will require 

" See R. S. Ledley, op. cit., PP- 474-487. 
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more attention to this deficiency as the systems used by industrial organiza- 
tions increase in complexity, number, and variety of use. And the advent 
of the electronic computer, which greatly facilitates the manipulations sug- 
gested, stresses the same point. Although “automatic diagnosis of systems 
problems using the storage, retrieval, and testing capacity of the computer 
is a practical possibility in a large number of cases, this capacity is rela- 
tively useless when the organized data required are missing. 

Finally, from what we have said it would also be clear that the relevance 
of the conclusions drawn from a logical or probabilistic diagnosis rests 
largely upon the accuracy with which symptoms are collected and tests are 
made. Although to some extent an iterative diagnostic-treatment procedure 
is self-correcting, because incorrect diagnoses will become evident in the 
long run, a lengthy sequence of critical tests may be worthless if even one 
of the symptoms collected is erroneous, or if a single test is positive when 
it should have been negative. The usual remedies for test malfunction, 
duplication or the application of redundant tests for check purposes, may 
alleviate this concern, but this is but another way of saying that inaccurate 
input data are expensive. There may be some economic or other justifica- 
tion in performing four inexpensive and less reliable redundant tests instead 
of one expensive sure one. In the end, however, contradictions must be 
resolved and the input must be correct before a correct conclusion can be 
drawn. 


11 1 Consider again the elevator problem of Chapter 2 (Project 2), and sup- 
po» lh, etoteto I« come to a Mop « rt>e bottom floor <md opened 
it, door. The following statement, pw i a „, , he d[ , he etevatot 

doors and the ascension o! the machine: 

A. Dispatcher starts manually 

B Weight greater than 1000 pounds 

C x.™ greifor than I, (tte for , he . 

D Door-open button depressed 
E At least one call registered 
F Doors closed 

C Safety signal on < e g , hand in door etc ) 

ir-itrftitt tarsr 5 - ™ = 

1A + B+ (C_E)I.(J5-F-C) 

SH If foS w 
— *• £ L 
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US 


A corporate spy b attesting » **»« » «™ ■*“ «*£ 

iactme this summer. He blows that it will be either a boat, s bimmower, 
or town furniture, some combination of these three, or none ci them. He 
aho knows that the following materials may (or may not) be used-wood, 
chrome, fiberglass, and brass— and decides to base Ws analysis on rnforma- 
tion gathered about these material inputs. Further he knows the foU<JW- 


tng facts: 

1. Wood is definitely used in boats, but not in lawnmowexs. 

2. Fiberglass will be used in mower frames, and maybe in boats and furni- 


ture. 

3. Brass is not used in Uwn furniture, nor in mowers. 

4. Chrome is used on boats, maybe on mowers, but not on furniture, 

5. If any of the four materials is purchased, at least one of the products 
in the above list will be made; however, if none of the materials is pur- 
chased, none of the products wdlkse made. 

A. Through diligent effort and sneaky observation, the Spy finds that the 
competitor has ordered wood and brass, but not chrome. He cannot how- 
ever, obtain any specific information about fiberglass at the moment. What 
specifically can be said about the three products at this stage? 

B. For a slight additional expense, the spy suggests that he could find out 
about the fiberglass order. Would this additional expense pay? 


115 


Richard E. Roxe of the Arizona Safety Council reports this case of an auto- 
mobile fatality, believed due to an emotional upset. A man whose wife 
had left him, who had just been fired from his job, who already bad serious 
financial problems, and who had just spent two hours in a bar drove off 
the road on a curve and was killed. (Phoenix Gazette, Arizona Days and 
Ways Magazine Section, Sunday February 13, I960, p. 9.) With these 
facts in mind, to what would you attribute the victim's death? 

The verdict of the investigator at the time was suicide. But he was 
wrong Further research established these additional facts; The dead man's 
wife had agreed to return home that afternoon, his boss had fired him 
alter learning he had a new and much better paying job, and he had drunk 
only two bottles of beer, having promised his wife he would never drink 
mot« than one bottle of beer per hour. What would these additional facts 
lead you to believe, or look for? 

The real cause of this aendent was a bahjoint front suspension. If 
had become detached. 
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causes and in many search problems. For example, in X-ray tests for 
tuberculosis or in industrial tests for faults in parts there is always the 
chance of obtaining a false positive or a false negative indication from 
a given procedure. When the incidence of faults is very small, this test "un- 
reliability can be a serious problem, as indicated in the following example. 

Suppose an X-ray procedure is used to detect faults in industrial cast- 
ings, such as small air pockets, cracks, and fissures. Let T be the fact that 
a fault is present and T be the fact that a fault is not present. Similarly 
let (+) stand for a positive test indication and ( — ) stand for a negative 
test indication. Then the four possible test-fault combinations are given 
by the table below; here the probability of each specific outcome which 
becomes a faint probability of obtaining together a specific test outcome 
and a specific fault result is shown in symbolic form. 


+ 

P(T) 
P(T) 


Now let P(+ | T) — the probability of a ( 4 - ) X-ray with no fauft = 0.02 
P(+ [ T) = the probability of a (+) X-ray if fault is present = 
0.09 

P(T) = the probability that a casting selected at random 
will have a fault = 0.01. 


p(-f) 

P(+ T) 

P(-D 

P( + T) 


*<-> P(+) 


A. Using the following conditional probability formulas 


P(+ \T) = 


P(+D 

P(T) 


and P(+\T) 


P(+T) 

PIT) 


find the joint probabilities given in the 2X2 table above. 

B. By summing across the rows in the above table, find the probability 
that a randomly selected casting will or will not have a fault. 

C. By summing down the columns in the above table, find the probability 
that a test when administered to a part selected at random will be ( — ) 
or ( + ). 

D. Show that the probability that a positive test result will be associated with 
the actual presence of a fault, namely P(T( +), is given by 0.0099/0.0297 
— 99/297 = 1/3. What is the practical implication of this result? How 
could this result be improved by a change in the testing procedure? 

E. What is the probability of obtaining a positive test result when the 
part is actually good? (This is the Type I Error of rejecting good parts.) 

F. What is the probability of getting a negative test when the part is 
actually bad? (This is the Type II Error of accepting bad parts.) If the test 
is critical, why should this number be very small? 
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same type, but ol too djEerent y ears of manufacture, 1960 sod 196a, ha'e 
be© lost The equipment is fa»w n to ha\e only two distinct n*K*J» 
failure, Fi and F z . Zither fault wffi| cause failure of a unit, w the WW 
equipment, Fj if twice as common as Fj, whereas in the 1%5 equipment 
F, is twice as common as F,. It 3s further Vnown that of the total set of 
devices on hand i were made in 1960. A single unit is selected and 
examined for mode Ft failure. If F^ is present for this unit, what is the 
probability that it was made in I960* Solution requires application of 
Hayes' Formula. [For further examples of this tjpe, see J. C. Kemeny, 
J L. Snell, and G. I~ Thompson, Introduction to Finite Mcthemctics. 
PrcnticerHaS, Englewood Cliffs, N.J., 1957, pp. 133-139.1 


pflOJECT 7 


One clause of a standard airbne contract (regarding damage) follows. 

‘(4) Subject to the foregoing: (a) liability of carrier for damages shall be 
limited to occurrences on its own line, except in case of checked baggage as to 
which the passenger also has a right of action against the first or the fast carrier. 
A earner issuing a ticket or checking baggage for carriage over the lines of others 
doe» so only as agent, (b) Carrier is not table for damage to passenger or un- 
checked baggage unless such damage is caused by the negligence of carrier." 

Su logical variables are under consideration, and will be designated by the 
letter* A through F, as follows: 


A ~ damages were in fact caused 
8 - baggage was involved 
C - baggage was checked 
D = passengers were involved 

E ~ esent occurred on own line of carrier (issuing ticket) 
F - there was negligence on the part of the carrier 


„J\" S £ apprad described in the test, create the 

bTc^ot^ “'5 “ ! T“' Uxn., create (be constrained logical b Mil 
by croMog o5 all tbe columns that arc esduded by tbe toms of the contact. 
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B. Show that the airline’s statement of nonliability (in Boolean form) is given by 

T = Z-b ABUE + AB(E + F) + AUEP + ABD 
which is to say, the carrier is not liable if there is no damage, or if there is damage 
to unchecked baggage when not on its own line or without negligence of the 
carrier, or if there is damage to unchecked baggage which occurred on its own 
line but without negligence of the carrier, or if there is no damage to passenger 
and baggage. 

C. Is the airline liable for damage of passenger and/or baggage if such damage 
occurs on its own lines? 

D. If there were more than six logical variables, how could you construct the 
constrained basis without going through the construction of the complete logical 
basis? For example, how many columns would be eliminated by throwing out all 
cases for which there was no damage at all? 

E. When there are six logical variables or less, the method of "Karnaugh Maps" 
can be used to simplified logical expressions, check on their consistency, and assure 
their completeness. (This method has not been described in this book.) Study the 
reference by B. Beizer and S. W. Leibholz, Engineering Applications of Boolean 
Algebra , cited in full in the bibliography (or any other good reference on Boolean 
Algebra or switching theory) and apply this technique to analysis of the contract 
problem above. 
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Perhaps no technique of systems anafysis has been so grossly misunderstood 
by the inexperienced investigator as the experimental activity popularly 

cabled “system simulation* 


REALISTIC SIMULATION 

To simulate, according to Webster, is to assume the appearance without the 
reahty In the same vein, a list of English synonyms for simulate includes 
feign, deceive, aSect, fake, pretend* play, falsify, fabricate, and make be- 
lieve- However, this is not what we mean when we speak of system 
simulation. 

The systems analyst seeks to construct a model or definition of a system 
that is realistic, and thus one that corresponds to reality in at least a few 
relevant particulars We discussed such definition forms in Chapters 1—8. 
However, d we arc to use the system definition for experimental tests we 
must pay attention to the creation of a model that may be varied and manip- 
ulated with ease, particularly jf the simulated system is complex. Manipu- 
lation of the model, or system definition, gives system simulation its power 
(or the ana!) rt. U is also essential that the system definition used, the 
manipulations employed, and the conclusions drawn from the manipula- 

m 
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lations be realistic, not idealistic. Skeet shooting is not duck hunting, but 
there is much to be learned from this simulated situation, because there 
are essential similarities. 


Advantages of Simulation 

The advantages of “simulating” a system can be easily summarized. The 
analyst controls many features of the system that he would not usually con- 
trol in practice, and can therefore relate known manipulations to known 
results. In addition, he controls the time and cost scales. Many repeated 
incidents that would take years to experience in reality can be simulated in 
minutes or seconds. The cost of obtaining such simulated experience is 
usually much less, and frequently much safer than experimentation with the 
real object. Thus, training pilots in Link Trainers and even highly sophisti- 
cated and expensive jet airliner cockpit mockups that include vibration, 
noise, and view, is far less expensive and much safer than similar training 
with real equipment. Similar benefits can also occur with changes in other 
scales, such as size. With reduced or expanded size, the system can be 
brought to a human scale, where it can be analyzed more easily. 

Simulation also permits the experimental testing of hypothetical condi- 
tions that do not exist as yet, so that projections concerning the operation 
of new equipment, or present equipment under extreme conditions can be 
made. For example, the System Training Program for radar operators in 
Europe enables allied military personnel to train against simulated attack- 
ing aircraft attempting to penetrate their defense from behind the Iron 
Curtain. Clearly allied training flights cannot penetrate Free Europe from 
this direction. 1 

These advantages are so compelling, that a wide variety of simulations, 
and simulation media have been used. 

To illustrate the range of possibilities, we discuss four approaches, 
which, of course, may be used in various combinations. 

Manual Simulation 

The ancient “map game” is an example of manual simulation. Played by 
teams of military officers using a map and movable pieces representing 
opposing troops, this chess-like exercise has often been used for training in 
strategy, for testing proposed tactics, or for appraising topological and 
logistic constraints imposed by the terrain. Following the advice of Machia- 
velli that the Prince should train himself, as he rode through his kingdom, to 

1 “Simulation,” Technology Series, Corporate Communication BRT-I2, System Develop- 
ment Corporation, Santa Monica, Calif. 1905. The outline of simulation types dis- 
cussed above follows that suggested in this publication. 
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m 

_ on 

-wvw- 
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a 


F = ma 
( 2)f - kx 

@* = if)» 

(a) 



(7) F = (p/U)4 
@ F = 2/>p S 4 

®* = <5* 


(b) 



© e = *» = *■ s 
©S-'S-*» 

® i = cf -out 

® / = (CT)a 


(C) 


12-1 Analogous forms of accelerometers or devices to measure acceleration, (a) A 
mechanical accelerometer; (b) a hydraulic accelerometer; (c) an electrical acceler- 
ometer. The equations describing these devices are explained in the text. (For additional 
examples, see D. W. Ver Planck and B. R. Teare, Engineering Analysis, Wiley, New 
York, 1958, an Introduction to problem formulation in equipment design ) 


displaced a distance x to the left, relative to the frame, stretching the 
spring. The amount of this displacement is determined by the opposing 
forces set up by the acceleration. A knowledge of physics tells us that the 
mass will exert a force (tending to stretch the spring) F = ma, where 
the units of F, m, and a are consistent. (For simplicity, we will elimi- 
nate the units here, because they provide only a scale factor. ) Conversely, 
the spring resists stretching with a force F = kx. Because these two 
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opposing forces must be equal, ma = kx, and the displacement is x = 
(mlk)a. In other words, the accelerometer may be made more sensitive 
by increasing the size of m , or decreasing the stiffness of the spring and 
therefore its constant k. 

In Figure 12-lb a glass tube of cross-sectional area A is filled with a 
liquid of density p. At rest, this liquid will reach the same height in each 
leg of the “U-shaped” bent tube. However, if the assembly is accelerated 
to the right, the liquid will rise in the trailing leg and fall in the leading 
leg. The displacement h of the liquid from its "at rest" position may 
again be found by analysis of opposing forces. First, the force caused 
by the acceleration will be that acting upon the liquid in the horizontal 
member, labeled L in the drawing to indicate its length. This force is 
again determined by the mass of liquid in this member, namely P AL, and 
the acceleration, so that F = (pAL)a, where P is the density of the 
liquid, A is the cross-sectional area of the tube, and a is the acceleration 
(all terms are expressed in consistent units). Similarly, the force due to 
the unequal height of the liquid in the vertical legs is F- 2 JipgA, where g 
represents the ^acceleration due to gravity, which tends to pull the liquid 
to the at rest positron. The two forces must be equal. When the equa- 
tions are set equal to each other and solved for h, sve find h = (L/2g)o 
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that will flow through a capacitor is proportional to the time rate of 
change of voltage, in particular, i ~ C(dE/dt). By substitution, we then 
have i = (Cfd)a. The ammeter A that measures the current i may thus 
be calibrated in units of acceleration a. To increase the sensitivity of the 
electrical accelerometer we may either increase C, or increase the propor- 
tionality constant Id in E = Idv by an alternate choice of tachometers. 
Note also that we have our choice of analogies between the electrical 
accelerometer and the previous examples. For example, we may choose 
to relate the capacitor C to the mass m in the mechanical example. In 
this case, the stiffening of the mechanical spring (larger k) would corre- 
spond to smaller proportionality Id in the tachometer. Conversely, we 
may relate the capacitor to the spring of the mechanical example. Then, 
the increases of the mass m in the mechanical example would correspond 
to an increase in the proportionality factor Jd in the electrical device. The 
mathematical relationship is not affected by what we choose to relate 
for experimentation. So long as the equations describing two systems are 
identical, the systems are analogous; we may then choose any convenient 
component analogy within that framework. 

By combining analogous components, we can construct many analogous 
systems. For example, we can use electrical amplifiers as an analog for 
mechanical gear trains that multiply angular rotation. We can also introduce 
simple logical elements in the construction of analogous systems: A discrete 
mechanical device, such as a lever that selects a given motion, is analogous 
to an electrical switch. The possibilities are endless, and in the past twenty 
years great ingenuity has been used in the construction of analogous com- 
ponents and systems to ease systems analysis. For many purposes, however, 
electrical analogies have been preferred because of their ease of construction 
and manipulation and their low cost.* 

Analog simulations also allow simultaneous measurement of system 
values in many points in the system, which may be a distinct advantage or 
requirement in hardware system design. Physical simulations, however, 
are limited in their precision by the physical precision of their mechanical, 
electrical. Or other components. Usually, precision to two or three significant 
figures, which may or may not be satisfactory for a given application, re- 
sults. (Digital computers, as we shall see, avoid this precision limitation.) 


'This is the basis for the analog computer, which is a general purpose simulator con- 
sisting of electrical components that may be hooked up in various ways to simulate a 
particular system Such equipment can also be used to solve mathematical problems. 
The first differential analyzer, constructed by Dr. V. Bush during World War II, was 
an analog device made from mechanical gears, wheels, clutches, and friction discs. 
These components performed the mechanical equivalent of mathematical differentia- 
tion, integration, and similar operations. The modem electrical computer does the same 
thing, but it uses electrical components. 
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Symbolic Simulation 

When a system definition can be reduced to a mathematical formula, the 
system can be simulated by an analog or by a digital computer. However, 
systems defined by a complex series of logical statements are best handled 
by a digital machine. The digital computer also permits extensive files of 
subroutines and, it allows databanks to be introduced into the simulation 
of systems. Finally, because numbers rather than physical quantities are 
manipulated, the precision available with simulation on the digital com- 
puter is not limited by the physical precision of components used in the 
computer's construction. 

The digital computer can usually perform all the tasks of the analog 
computer; it also permits greater flexibility in system simulation. Therefore, 
although it is more expensive than the analog computer (which is used most 
often in engineering and design work), the digital computer is the first 
choice for the highly conditional business systems that require a combina- 
tion of great logical flexibility and precision. Indeed, science and engineering 
may require precisely these abilities. The wind tunnel, an analog device, is 
still used, but today it is commonplace to “fly” a new aircraft or missile 
design on a digital computer, often to save time and money in the con- 
struction of physical models and to test behavior that cannot be evaluated 
in a wind tunnel or similar physical simulator, e.g., satellite re-entry. 

By using the digital computer, some ear/ier forms of manual simulation, 
such as war gaming, having been greatly expanded in complexity and 
realism. For example, we can construct war games that include international 
rather than local complications. 

Moreover, we can combine analog and digital simulation advantages 
by selected use of analog devices and digital capability. Thus, the flight 
simulator cockpit presents the trainee with a number of analog input and 
output devices, and analog computer devices can control simulated operat- 
ing conditions. But digital capability expands the situation to include 
certain “flight conditions” external to the man-machine interface, such as 
weather conditions and flight instructions. 

The digital computer works with symbols, and the logical requirements 
of symbolic simulation are substantially greater than those of earlier forms 
of simulation. Every system relationship and inter-relationship must be pre- 
cisely spelled out. Special techniques have been developed to simulate 
physical situations that are foreign to everyday experience. Because the 
digital computer essentially works sequentially, making one computation 
or one decision at a time, extensive control routines are required. However, 
these disadvantages cannot often withstand the overwhelming advantages 
of flexibility, speed, and precision offered by digital equipment. Therefore, 
we wifi devote the latter portion of this chapter to some of the details of 
this form of system simulation. 
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Operational Simulation 

Operational simulation, so-called to distinguish it from the forms previously 

discussed, is another variant of simulation. statements 

Factors that ate irreducible to quantitative terms or logical statements 
play aXortan. part in many systems. Only if qualitative factors arc 
deliberately included in such systems is the completeness of the ™“ !a “ n 
assured. An algorithmic simulation may fail because of the exclusion 
highly variable human intervention factors, or other political or socta 
factors. A given simulation problem must be met by applying the appropna e 
simulation discipline. . 

We accomplish this by using simulation within operating environments, 
in which human participants use their judgment and other human abilities 
to interact with the simulated system. The simulated system, when provided 
with any necessary "embedding conditions,’' such as input data, environmen- 
tal simulation, and sensory stimuli, evaluates the responses of the experi- 
mental subjects and feeds back this information. In its most obvious form, 
this is the classic Link Trainer. However, the same idea may be extended 
to very large systems. For example, a group of planners may be supplied 
with industrial input-output data describing a country’s economy. The plan- 
ners may be told that a bomb has been dropped on a given location and 
they may be given specific damage reports on the industry in that city, 
together with nation-wide projections of the local disaster by economic 
structure tables. The planners may then be asked to develop emergency 
orders for the diversion of shipments from the peace-time pattern so that 
essential industries can be maintained. The simulated "economy” then eval- 
uates these plans and feeds back the new status to the planners. A series of 
these cycles not only permits a greater understanding of the inter-relation- 
ships in the economy, but also aids in the development and evaluation of 
emergency plans 

Similarly, large-scale communications systems, such as air traffic control 
and early warning defense systems, that involve human operators as well 
“ < f cns,vc hardware offer another opportunity for operational simulation. 
Both operator reliability and skill and equipment effectiveness are tested 
by this means. 

Although an electronic computer is not always necessary, the more 
comprehensive forms of operational simulation would be impossible with- 
out the aid of a digrtal computer. For example, an evaluation of the SAGE 
sjstem ior early warning using simulation, as would he the case with any 

XnlTr" 7 >»*«■. involves staggering amounts of data 

and computation In simulated lest, of the SACE system, 26 computers were 

*7 ““h”' * ch co " Mhav ‘! processed— (in just 24 hours)-all annual 
ioottic tar returns Sled the United States. Without these powerful central 
processors, operations simulation could not have been used in SAGE; how- 
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ever, lesser forms of simulation could not have met the critical training and 
evaluation problems of this system. 5 

Standard Computer Programs and 
Operations Research Models as Simulators 

'Hius the present state of the art usually associates large-scale system simula- 
tions with an analog computer, or, even more frequently, with a digital com- 
puter. The digital computer has another advantage over the analog compu- 
ter in the exploitation of the stored program and stored data. 

Unlike the analog computer, which must be set up for each different 
simulation project, the digital computer is set up by the construction or 
selection of a program of instructions, which may then be stored permanently 
for later use. In this way a large range of available programs for simulat- 
ing different systems may be collected, j'ust as computer programs are 
collected for solving repetitive problems. The same argument holds for the 
data accumulated for a simulation experiment. These data are not lost, and 
are easily recorded and stored for later use. 

For this reason it is often possible to modify or piece together a new 
simulation “program” from blocks or segments of old computer programs, 
by using these formerly constructed blocks as subroutines. This is not only 
beneficial strategy to the analyst, but also useful in creating an accumulated 
knowledge of simulation technology. 

Moreover, the stored program concept calls to our attention the fact 
that presently available library programs, as they stand, may be used as 
simulators in many cases. In addition to many "general purpose” simulators 
that by parameter specification can simulate specific systems (such as in- 
ventory control installations, job shops, critical path networks, and the like). 
Standard computer algorithms, or programs, can also be used. 

Thus, a standard program for solving linear programing problems is, 
in effect, a ready-made simulator for testing hypothetical conditions or 
variants in current practice. Such programs are widely used for this pur- 
pose, although often their users do not speak of system simulation when 
conducting such experiments, but rather use more specific descriptions, such 
as "parametric programing,” to indicate the more specialized application of 
their work. 


SOME CONSTRAINTS ON SIMULATION AND ITS USE 

Because of the glamor and the potential of large-scale simulation studies 
using the digital computer, many investigators have fallen into severe pit- 
falls by indiscriminate use of this approach. 

Simulation is a means to an end, not an end in itself. The purpose of 

• Example quoted in "Simulation," Systems Development Corporation, he. cit. p. 6. 
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constructing a simulation experiment is to solve a specific problem, to 
investigate the characteristics of a specific system, to train operators in a 
given skill, or to evaluate given alternatives. In some cases, a simulation may 
be useful in exploratory investigations, or in a few cases the construction 
of a simulation “to see if it can be done” is an end in itself for educational 
purposes. But, by and large, to embark upon a large-scale system simulation 
without clear objectives in mind is to invite disaster. 

First, there may be many easier ways to solve a particular problem than 
by simulation. The solution of an equation may be all that is required for 
a specific problem. Often a simple manual simulation will suffice. 


Second, in absolute terms, digital simulation of a complex system is 
expensive. Although such a simulation may repay its own cost many times 
over, it is expensive to design, to develop, and to maintain. Not only must 
the system be defined in great detail, but vast amounts of specifications and 
environmental data must be collected and organized. Often extensive com- 
puter programs must be written and debugged. Testing and perfecting the 
simulation programs may me much computer time before the programs 
ate ever used to obtain experimental results. In operational simulation 
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use J7 7' m n ” ded ' if the economic 

of riml b J* n0t , te P* in m >nd, and « the technical elegance 
hZJ ‘ “ lively controlled, cost can quickly become pro- 

^ susr ? ■ ^:“ 0 b ran " 

essential ingr^drats’thlt’valiitfar 15 Tr' 3 * 10 '" ran ako obscure the 
cm*™ to reality uS JTOlUbon, i.e„ those that cause it to 

detemnne the crihe.1 Crt'" “ '° , bc “ afe We ■”“* 
tmnale for selection of these features ™ 1 ”“ ented and "“he clear the ra- 
il the investigator attempts to inrt t ^ 1° success can be muddled 
the simulation, and he svill a j wav U i, °° f rJc b undifferentiated detail in 

belief that compute “f !“ <, ° ** ■» *<= 

can usually process the detail often tK ■ 1 easdy ’ Although the computer 

that he produces a s y ^ ""'“‘gator «, with the result 

uncr editable results emerge that he m 1x .. aamot ' m,J erstand. Weird and 
*he innumerable internal interactions hT*, 0301101 e *pkin because of 
, Another pitfall * forcing n ,eft t0 the machioe * 

ta the system at hand. Fofexample Wi “ “ ? cot !»s*ified or possible 
constructed by eliminating all n^ato,-" 5 ' ’>™bohc simulations have been 
on system factors that can ta e ^«s S ed? “ “ d “"^bating only 

ltd' “ alsQ tr “ e ™ “ny mathnltiea| Ua h n a UhVe tenm for cont- 
end management science, hot the urge to -? ’ *" °P<«Uous research 

8 ° etanmte anything that cannet be 



Some Constraints on Simulation and Its Use 369 


reduced to a sequence of logical statements is even greater in digital simula- 
tion because of the computers logical demands. If the techniques to repre- 
sent some object, event, or condition are not available, the simulation should 
be dropped rather than compromising or simplifying assumptions, because 
the results obtained will be useless, or worse, misleading. Here another 
approach, although less "precise,” would be more relevant and valid for the 
problem. 

Next, the computing and data-handling ability of the digital computer 
places a serious burden on the investigator in his specification of the experi- 
mental tests and experimental results he will require. The most common 
pitfall in computer simulation is to "try everything possible" and to “print out 
everything possible.” Because for complex simulations the digital computer 
can try literally billions of possible input conditions and print out literally 
mountains of data, careful selection of input conditions and the specific 
output required is a necessity that greatly tests the investigator’s skill. If he 
does not have a clear understanding of the system he is to simulate, the 
investigator is not likely to make a very good selection of the input condi- 
tions to be tried or to anticipate the summaries and analyses of the output 
that will be valuable to him. The picture of the analyst surrounded by piles 
of computer output forms, searching through this maze for something 
worthwhile, is not only humorous, but also pathetic, frustrating, and expen- 
sive. Unfortunately, the picture is not at all uncommon. In scientific and 
engineering applications, where the analyst usually understands the "phy- 
sics” of the problem, unrealistic results are more easily screened out, But 
in business applications, great care is required because of the looser problem 
structure and the increased amount of detail that often enters the simulation 
by necessity. 

Finally, if the simulation is to take place on a "real-time” basis or faster, 
as is required in the evaluation of many process control and military appli- 
cations, the abilities of the computer may be strained if the system simula- 
tion is too complex. For example, in air traffic control, to be of any opera- 
tional use, an evaluation of new input conditions may be required in seconds 
or fractions of a second. Although the computer may be able to perform 
rudimentary operations in billionths of a second, the computations required 
in the evaluation still consume a finite amount of time, and, if too numerous, 
may exceed the response time required for effectiveness. A similar argument 
holds for experimental work in which a large number of replications must 
be carried out for statistical evaluation of given input conditions. 

Within these constraints, however, there still remains a wide range of 
possible applications for digital simulation of complex systems. With this 
background we will now turn to some of the details of system simulation by 
computer. For simplicity, however, our approach will be via the route of 
manual simulation, which may be extended to the computer. The principles 
required are best illustrated in this way. 
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Table 12.1 Data ior Simulation Example 


1. Hem routing 


2. Machine data 


Item Route If inspection fails, 
total route becomes 

A 1-4-3 1-4-3- 2-3 

B 14-W 14-2-3-2-3 

C 1-2-3 1-2-3-2-3 

D 14-3 (Scrap, rework not possible) 

Note: l£ material is rejected at inspection it is 
returned to the Saw and Straighten Operation, ex- 
cept for product D, which is scrapped. Assume 
that the reworked material always passes the 
second inspection. 


Machine Setup time (hours) 

1 1.00 

2 2.00 

3 0.00 

4 LOO 


Running time 

(hours/unit) 

0.01 

0.01 

0.01 

1.00 per batch 


Note: Machine 4, the furnace, works on a batch 
process only. A setup is required before each 
batch. A batch can be less than hut not more 
than 250 units. 


3. Inspection data The inspection step. Operation 3, rejects 20 X of 

the units passing through it, and units rejected 
are reworked or scrapped according to the rout- 
ing data above. 


4. Move time 


Each move of an order between machines takes 
1.00 hours. No job is moved until it is completely 
finished at a given step. This includes the batch 
production at machine 4. No partial Jots are moved 
unto the entire job is complete. 


EXAMPLE OF SIMULATION TECHNIQUE 

sider a man' 11 '?'?' ' tt<: delail3 sys,ni ’ sta ™latron by an example. Con- 

““T 8 the macl ’ ines and Bows *""> fa Figure 
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5. Priority of work 
in case of conflict 

Priority determined by last work center, as shown 
in Figure 12-3 by a, b, c. Runs are not broken to 
observe priorities, but they are observed at change 
time. 

6. Workload 

Item Units required to finish Units started 

A 500 units 500 units 

B 500 units 500 units 

C 100 units 100 units 

D 100 units 125 units 

7. Schedule 

The work begins in the order A-B-C-D on Ma- 
chine 1, and thereafter follows the specified rout- 
ing. 

8. Production lot sizes 

The total number of units reguired is made for 
each item at one time. Runs are not broken for 
priorities, or to make work go through the shop 
faster. 

9. Interrelated items 
or setups 

There are none. 

10. Internal shop 
communications 

There is no communication between the work 
centers , so that one work center cannot anticipate 
the j'obs arriving from others. Thus, setup can- 
not start at a given work center until alter a given 
job arrives at its location. 

11. Initial conditions 

At time t = 0, assume that the shop is empty, 
that is, that there is no work in process anywhere 
in this shop. 

12. End of simulation 

The simulation is to stop when all of the items 
scheduled, including material reworked, has been 
completed in the shop. At this point the shop 
will be empty, and no work will remain for any 
of the machines. 


illustrated by the table of data reguired for even this simple problem, as 
shown in Table 12.1. The data of Table 12.1 have been developed for 
illustration and do not reflect the actual data from an extrusion plant. To 
simulate a particular system, field data representative of the given system 
must be collected. Table 12.1 also indicates a number of production con- 
ventions that must be specified in detail, and are here developed arbitrarily 
for test purposes. For example, the priority of work at each machine is to 
be determined by the last work station that processed a job rather than by 
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EXTRUSION 
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12-3 A block diagram of a simple shop to be simulated. For a specific example. 
Machine 1 Is 8n extrusion press. Machine 2 Is a saw and straightening operation. 
Machine 3 is an inspection operation, and Machine 4 Is a furnace for heat treatment 
of the extruded metal. This collection of steps is common In the manufacture of ex- 
truded shapes, such as aluminum window frames. The data required for simulating 
this shop ere provided in Table 12.t, 


some other mle, it is assumed that there are no internal communications 
brtivecn machine in (lie shop, and a shirting schedule and initial conditions 
are specified 

Some of the complexity of this simple illustration is evident when we 
ponder the number of alternative schedules, initial conditions, priority rules, 
work loads and product mixes, machine capabilities, and the like, which 
EaCh COTceivable I"™ the specified condi- 
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12-5 An estimation problem using the Monte Carlo Method. The rectangular area 
ft Is known. The problem Is Jo estimate the Irregular area A. By the Monte Carlo 
Method a3 described in the text, a random sample ot squares In tho grid permits 
estimation ol the ratio A/ft, and therefore A, because R is known. 

approach, we may proceed as follows. First, divide the vertical and 
horizontal axes ol R into equal segments to produce a grid. For con- 
venience, let the grid have ten by ten divisions, producing 1G0 boxes, 
each yV» the area of R. Number each row i = 0, 1, 2, 3, . • 9, and each 
column r = 0, 1, 2, 3„ . . ., 9. Each small box may then be indexed by the 
two numbers i, j. We will now randomly select a small box i, f, AS shown 
below, and record a "yes’’ result if any portion of the irregular figure 
enters the box selected, and a "no” otherwise, (This decision rule will 
tend to overestimate the size of A, but the error may be minimized by 
using a larger number ot scale divisions. The procedure is not aSected by 
this problem.) 

To select the sample boxes a number of procedures could be used. We 
could place all the numbers 0, 0 through 9, 9 in a hat and draw them 
out. A more efficient way is to use a random number table, which is a 
tabulation of digits each of which is equally likely (he,, each decimal 
digit has a ^ probability of appearing in the table), and each of which 
is distributed uniformly throughout the table so that no sequential pattern 
of occurrence can be found. Table 12.2 shows such a table of random 
numbers. 
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Table 12.2 Random Numbers' 


09 

73 

25 

33 

70 

53 

01 

35 

86 

54 

20 

48 

05 

64 

89 

47 

42 

96 

42 

26 

89 

53 

19 

64 

50 

93 

03 

01 

90 

25 

29 

09 

37 

67 

07 

15 

80 

79 

99 

70 

80 

15 

73 

61 

47 

06 

57 

47 

17 

34 

07 

27 

68 

50 

08 

01 

08 

05 

45 

57 

18 

24 

06 

26 

97 

76 

02 

02 

05 

16 

58 

92 

57 

33 

21 

35 

05 

32 

54 

70 

48 

79 

64 

57 

53 

03 

52 

96 

47 

78 

52 

01 

77 

67 

14 

90 

50 

86 

07 

80 

50 

54 

31 

39 

80 

82 

77 

32 

45 

29 

90 

34 

00 

28 

89 

80 

83 

68 

34 

02 

00 

86 

50 

75 

84 

01 

59 

46 

73 

48 

87 

51 

70 

49 

69 

48 

11 

70 

74 

17 

46 

85 

09 

50 

12 

43 

50 

35 

17 

72 

70 

80 

15 

35 

09 

98 

17 

77 

40 

27 

72 

14 

91 

62 

68 

03 

60 

25 

22 

91 

48 

89 

32 

05 

05 

14 

22 

50 

85 

14 

49 

91 

45 

23 

68 

47 

92 

76 

86 

33 

69 

45 

98 

26 

94 

03 

68 

58 

10 

48 

19 

49 

85 

15 

74 

79 

54 

55 

07 

37 

42 

11 

10 

00 

20 

40 

60 

64 

93 

29 

16 

50 

53 

44 

84 

19 

69 

04 

46 

26 

45 

74 

77 

74 

47 

44 

52 

66 

95 

27 

07 

99 

53 

55 

72 

85 

73 

67 

89 

75 

43 

87 

48 

11 

62 

13 

97 

34 

40 

87 

21 

52 

37 

83 

17 

73 

20 

88 

98 

37 

49 

35 

24 

9-1 

75 

24 

63 

38 

24 

54 

99 

70 

54 

64 

05 

18 

81 

59 

98 

31 

53 

07 

20 

89 

80 

93 

54 

80 

80 

83 

91 

45 

42 

72 

68 

42 

05 

88 

52 

36 

01 

39 

09 

22 

86 

17 

90 

02 

97 

87 

37 

92 

52 

41 

23 

46 

14 

06 

20 

11 

74 

52 

04 

56 

54 

14 

30 

01 

75 

87 

53 

79 

15 

51 

49 

38 

19 

47 

60 

72 

40 

86 

43 

19 

94 

36 

16 

81 

08 

5L 

08 

62 

48 

26 

45 

24 

02 

84 

04 

18 

51 

62 

32 

41 

94 

15 

09 

49 

95 

10 

04 

06 

96 

38 

27 

07 

74 


34 67 35 48 76 
24 80 52 40 37 
23 20 90 25 60 
38 31 13 11 65 
64 03 23 66 53 


SO 65 90 
20 03 61 
15 95 33 
88 67 87 
98 95 11 


90 17 39 
04 02 00 
47 64 35 
43 97 04 
68 77 12 


29 27 49 
82 29 16 
08 03 38 
43 82 78 
17 17 68 


38 69 73 81 70 
35 30 34 26 14 
68 60 57 48 18 

90 55 35 75 48 

35 80 83 42 82 

22 10 94 05 58 

50 72 58 82 48 
13 74 67 00 78 

36 76 66 79 51 

91 82 60 89 28 

58 04 77 69 74 

45 31 82 23 74 
43 23 60 02 10 
36 93 68 72 03 

46 42 75 87 88 

46 18 28 35 54 
70 29 73 41 35 

32 97 92 65 75 
12 86 07 46 97 
40 21 95 25 63 

51 92 43 37 29 

59 36 78 38 48 
54 62 24 44 31 
16 88 84 87 67 
68 93 59 14 18 

45 86 25 10 25 
96 11 96 38 96 

33 35 13 54 62 
83 60 94 97 00 
77 28 14 40 77 

05 56 70 70 07 
15 95 66 00 00 
40 41 92 15 85 

43 88 79 45 43 

34 88 88 15 53 

44 99 90 88 96 
89 43 54 85 81 
20 15 12 33 87 


65 81 33 93 85 
86 79 90 74 39 
73 05 38 52 47 

28 46 82 87 09 
60 93 52 03 44 

60 97 09 34 33 

29 40 52 42 01 
18 47 54 06 10 

90 38 47 64 93 

93 78 56 13 68 

73 03 95 71 80 
21 11 57 82 53 
45 52 16 42 37 

76 62 11 39 90 
96 29 77 88 22 

94 75 08 99 23 

53 14 03 33 40 
57 60 04 08 81 
96 64 48 94 39 
43 65 17 70 82 

65 39 45 95 93 
82 39 61 01 18 

91 19 04 25 92 
02 07 11 20 59 
26 25 22 96 63 

61 96 27 93 35 

54 69 28 23 91 

77 97 45 00 24 
13 02 12 48 92 
93 91 08 38 47 

86 74 31 71 57 
18 74 39 24 23 
60 67 43 68 06 
59 04 79 00 33 
01 54 03 54 58 
39 09 47 34 07 . 
88 69 54 19 94 
25 01 62 52 98 : 


11 19 92 91 
23 40 30 97 
18 62 38 85 
82 49 12 58 
35 27 38 84 

50 50 07 39 
52 77 58 78 
68 71 17 78 
29 60 91 01 
23 47 83 41 

40 21 81 65 
14 38 55 37 
90 28 60 26 
94 40 05 64 
54 38 21 45 

37 08 92 00 
42 05 08 23 

22 22 20 64 
28 70 72 58 
07 20 73 17 

42 58 26 05 
33 21 15 94 
92 92 74 59 
25 70 14 66 
05 52 28 25 

65 33 71 24 

23 28 72 95 
90 10 33 93 
78 56 52 01 
70 61 74 29 

85 39 41 18 
97 11 89 63 
84 90 28 52 
20 82 66 85 
05 01 45 11 

35 44 13 18 
37 54 87 30 
94 62 46 11 


. „ , _ T . - t -<r and E. L. Araoff, Editors, Introduction to Opera • 

• From C. \V. Churchman, R. t.. ‘jje 

Hen, Research, Wiley, New VoA 1957, p. 
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The procedure For using this table is to start at any randomly chosen 
starting point, reading off the sequence of digits that follow any con- 
sistent pattern. (Row or column reading is most usual.) For example, 
in Table 12.2, if we start for simplicity at the upper left-hand comer, 
the first two-digit random number (columns 1 and 2) is 09. Reading 
down, the next random numbers are 54, 42, 01, 80, etc. {After arriving 
at the end of the first two-digit column we could move to the top of the 
next column and so on.) 

Now to continue with the estimation problem, we decide upon a sam- 
ple size and choose a two-digit random number for each sample box se- 
lected, Suppose at first we arbitrarily take a sample of n = 10 boxes, using 
the first ten numbers of the random number table starting with 09. Let 
the first digit represent the value of i and the second that of so that a 
two-digit number will select a small box from the area R. By inspection of 
Figure 12-5, record a "yes" or a “ 00 “ for each of the ten samples, as 
below. 


Random Number 

Result 

09 

No 

54 

Yes 

42 

Yes 

01 

No 

80 

No 


Random Number 

Result 

06 

Yes 

06 

Yes 

26 

Yes 

57 

Yes 

79 

Yes 


To complete the estimate, compute the fraction p of “yes” answers in the 
total 


— ( number of “yes” answers ) 

(sample size n). 

Because p is an estimate of the fraction of R occupied by A, we have as 
an estimate for A simply A = pH. Here the estimate of A for the sample 
chosen is A - 0.7H. If area H is ICO square centimeters, then our esth 
mate of A from the simulation is 70 square centimeters. 

The error in p may be found by recalling that the yes-no results are 
Bernoulli trals, and that the results evil] follow the binomial distribution. 
From thu know ledge the formula for the standard deviation of p is 
A/Hfo?p.^W ' P U thC ,rU ' '' a,US P = A/R ■ Substituting 
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from which it may be seen that the estimate of p (and therefore of A ) 
may be improved by increasing the sample size n, or by making A a large 
part of fl, (R is an arbitrary boundary set up for the sampling proce- 
dure.) Conversely, given R and a desired value of a p> the sample size 
required to achieve this result may be obtained from the same formula, 1 

We now consider a different problem using the Monte Carlo approach 
to estimate a survival probability. 

Example: The game of Russian Roulette is played with a six-shooter 
hand pistol. One of the six chambers is loaded, the chambers are spun, 
and one participant points the pistol at his head and pulls the trigger. 
If we assume the spin has randomly placed the loaded cartridge, the 
probability of survival for one trial is #, and the probability of death 
is £. What is the probability of survival after three trials, if we assume 
the chamber is rotated at random at each trial? (It is convenient to speak 
of a probability of survival rather than a probability of death because 
the participant must survive all three trials to survive this game.) 

Applying the Multiplication Law of Probability we find (#)* = 
125/216, or a little better than one-half. How could we have obtained 
this result by sampling? 

We now introduce the concept of coding the random numbers drawn 
from a random-number table. Because the probability of death on one 
trial is | or 0.167, and the probability of survival on one trial is |, or 
0.833, we may code the states Die = D and Live = L, rounding to two 
figures, as 

Random numbers Outcome Fraction of 100 Random No. 


00-16 D i 

17-99 L i 

If the random numbers are drawn from the table as before, and if they 
are interpreted to mean L or D as in the above table, then we have con- 
verted the random-number table to a table of L s and D s in which the 
Ds will be i of the total number and the L’s will be i. 

The Russian Roulette game may now be simulated by developing a 
table listing a number of experiments as rows, each of which consists of 
three trials. The table is filled-in using the random numbers in their 
equivalent interpretation L or D, and noting that the participant will 

'Because the confidence interval for large and small values of p will not be symmetrical 
for the binomial distribution, a more accurate determination of a sample size for a 
given confidence interval requires consultation of tables of the binomial distribution. 
The formula above assumes that the sample size is sufficiently large so that the normal 
distribution can be used to approximate the binomial. 



332 System Simufatlon 


F^'i in the ou,con,c L - L 

the estimate of the survival mjfaM h T n d CJr P erime " ts - To obtai, 
ratio of "yes" r “ U te to 3 f ,Cr "T Wa ><. «o compote lb, 

obtain f f 0r fte len simulated e!^erimen“' to r” 1 b' HS<B ? 12 '®’ 

information we could use the Sue of n - i ^ absence of an >’ 0,llel 
the true value and compute the stand, f 7 *. “ an a PP™*nation to 
in the previous example. The experimmf . dc ™ tl0n of estimate, as 
123/2.6 as the number "'° M “ me ” rarer 


and ask how w™ SnSeS'a'Som n'Jtbe “f '" 8 pr0CKlure i“t described 
ontcomes from a given prX^ ttSST”* ‘ ^ °‘ ^ 


part m 4 field is gi ve „ by the protobilbj • dbMbutira * giVC " 

x ~ No. o f failures/weelc 

P(r) 


i*®*, in 10 weeks out of Iflo it 

one, in 20 weeks there were twu'Tnd to 30 wecb U,ere w “ 

more fadures in a week have not be«^ 10 "“H ,heIe lI ' re< ’- •"»»' m 
« - <o generate a 


*•**”•« "andom 


1 

2 

a 

4 

5 

6 

7 

8 
9 

10 


*" or D Survival? 


°9» 54, 42 
01, 80, 06 
°6. 26, 57 
79 . 52 , 80 
45, 68, 59 
48, 1 2 , 35 
®1. 89, 49 

33, 10, 55 
60, 19, 47 
53, 48, 52 


d.l,l 

D,L,L 
D, L,L 
L,L,L 
l,l,l 
l,d,i 
UKL 
l,d,l 
l,l,l 

— > ■* t, » oz L L L 
Estimated Probability of Survival = Ntln.be, nl -v. 


No 

No 

No 

Yes 

Yes 

No 

Yes 

No 

Yes 

Yes 


= ^SSS^L 2es- Benda s 

ta-a Simulation of a Russian Ft 01 Expenments to~T 

exnJrf HCal answef 125/2J6 « re *. Tha e *Per«mentai estimated ® xpe, ‘ menl consists 

-~,.e, TOa , i „ 8no 0^r« = may fce 
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by the probability distribution shown below (which has an average 
of 1.0) 

Move Time x ( h ours) 0.5 1.0 15 2.0 

F(x) 0.4 0.3 0.2 0.1 

Then, each time a move occurred in the job-shop simulation we could 
consult a random-number table, pick the next random number, and find 
the equivalent move time to use in the simulation. This randomness m 
move time would produce a distribution of results for the simulation, 
say for the total completion time. With this probabilistic element in the 
simulation, we must repeat the simulation procedure a large number of 
times to obtain an estimate of the average total completion time. The 
number of repetitions required for a given precision of estimation would 
depend upon the dispersion of sample results obtained in the simulated 
total completion times. 

The reader should not forget that, in addition to coding discrete variables, 
as shown above, the same procedures can be used to code and generate 
attributes at random. Thus, if a light can be red, yellow, green, or out with 
given probabilities, it would be possible to generate a series of R, V, G or O 
outcomes with the required probabilities using the same methods as before. 

Sampling From Continuous Distributions 

With little difficulty, the above methods can be extended to continuous dis- 
tributions, as shown in Figure 12-7. First the continuous distribution is con- 
verted into a cumulative probability distribution. Graphically, the horizontal 
distribution represents the range of the random variable to be described, as 
in Figure I2-7a. The vertical scale has 100 equal divisions if two-digit ran- 
dom numbers are to be coded, 1000 equal divisions for three-digit coding, 
and so on, and each division is numbered from 00-99, reading up from the 
origin. This procedure, in effect, approximates the continuous curve with 
a discrete distribution having 100 steps. To generate the random variable, 
we again look up a two-digit random number, enter the left-hand scale of 
the cumulative distribution at that interval, and read across and down, to 
obtain the corresponding value of the random variable. The required coding 
is provided by the relation of the equal intervals to the left and the contin- 
uous cumulative probability curve. We can handle any continuous distribu- 
tion in this manner. 

Of special importance is the normal probability distribution, which is 
frequently encountered ui simulation work, and which is, of course, con- 
tinuous. The same procedure holds, except that it is worthwhile to note that 
by using the standard cumulative normal distribution (which expresses the 
random variable x in terms of the standardized variable t — (x — x)/a, and 
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(x)j . 1 , 1 I 1 

04 06 08 1 0 12 14 16 


12>7 Generation of random variables from the cumulative probability distribution, 
(a) An arbitrary probability distribution in cumulative form. Values of the variable are 
shown on the scale below (a). To generate a value of this variable from the given 
probability distribution, a random number is selected from the random-number table, 
such as Table 12.2. Moving to the right, this random number selects a point on the 
cumulative curve. The corresponding value of the variable is found directly below 
(the random number 35 corresponds to the variable value 46 In this case). The stan- 
dard cumulative normal probability distribution offers a special case of Interest (b). 
The use of the standard curve, normafi 2 ed by use of the standard normal deviation 
l ~ (x~x)/ir, permits the same curve to be used for any normal curve by specifica- 
tion of x and «r. This Is Illustrated by the second scale below (b), which is for 

* = 1 0, <r = 0.2. 


therefore has an average of x = 0 and <r =* 1), we can generate a series of 
standardized values that may be used in different applications by the con' 
version x ~ x + t ff . For example, the standard scale may be provided, as 
in Figure 12-7b, and underneath it a series of graphic scales may be pro- 
vided for graphic transformation of the random normal deviation t into a 
specific random variable through knowledge of x and a for that case. Figure 
12-7b shows a scale that would work for the job-shop simulation if the 
move time had an average of one hour, a standard deviation of 0.2 hours, 
and a normal distribution. 

The usefulness of the normal distribution in simulation and the fre- 
quency of use of the above random generation process suggest that tabula- 
tions of random normal numbers be made up, as shown in Table 12.3. This 
table shows the result of transforming a table of random numbers to random 
normal deviation (i) by careful mathematical application of the procedure 
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Table 12,3 Random Normal Numbers* 


n — 0 , » = 1 



(1) 

(2) 

(3) 

(4) 

(5) 

(8) 

(7) 

1 

0.464 

0.137 

2.455 

— 0.323 

—0.068 

0.296 

—0 288 

2 

0.060 

—2 520 

—0.531 

— 1.940 

0543 

—1.558 

0.187 

3 

1.486 

—0.354 

—0 634 

0 697 

0.920 

1.375 

0.785 

4 

1022 

—0472 

1.279 

3.521 

0 571 

—1.851 

0.194 

5 

1.394 

—0555 

0.040 

0021 

2.945 

1.974 

-0258 

6 

0.900 

— 0.513 

—0525 

0.595 

0881 

—0.934 

1.579 

7 

1.179 

— 1.055 

0 007 

0.769 

0 971 

0.712 

1.090 

8 

—1.501 

— 0.488 

—0.162 

—0.138 

1.033 

0.203 

0.448 

9 

—0690 

0.750 

—1.018 

—0.445 

—0.511 

— 2.051 

-0457 

10 

1.372 

0.225 

0.378 

0.761 

0.181 

— 0.736 

0.960 


-0482 

-1.370 

- 1.010 


1.677 
-0.150 
0 598 


-0.057 

1.350 

-0.918 


-1.229 
-0 561 
1.598 


-0.480 

-0.250 

0.005 


2 273 
0.041 
—1.132 


0.600 
-0.307 
—2 098 


0.747 

0790 

0.145 


0850 

0.212 


14 

-0 005 

—0899 

0.012 

-0.725 

L147 

—0.121 

15 

1.393 

—1.183 

-0.911 

1031 

— 0.199 

—0.246 

10 

— 1.787 

—0.201 

1037 

1.040 

— 0.508 

—1.630 

17 

— 0 105 

—0057 

— 1084 

0360 

-0.992 

—0.116 

18 

— 1.339 

1027 

— 0.959 

0.424 

0 969 

—1.141 

19 

1.041 

0.535 

0.731 

1077 

0 983 

—1.330 

20 

0.279 

-2 050 

0.717 

—0.873 

—1.090 

—1090 

21 

— 1.805 

-2 008 

-1.633 

0.542 

0050 

0.160 

22 

— 1.186 

1.180 

1.114 

0882 

1065 

—0002 

23 

0 658 

—1.141 

1.151 

-1O10 

—0 927 

0.425 

24 

—0.439 

0058 

—1.939 

0.891 

—0.227 

0 602 

25 

1398 

—0.230 

0085 

—0849 

-0577 

0 237 

26 

27 

0199 

0 159 

0.208 
n o-7i 

—1.083 

—0.219 

-0091 

1021 


— 2.828 
0.247 
— 0.584 
0 440 


-0.439 
1.291 
0.541 
-2 661 


31 

32 

33 

34 

35 

30 

37 

38 

39 

40 


0 768 
0.375 
— 0.513 
0.292 
1020 

— 1.334 
— 0 287 
0.101 
— 11146 
1.250 


0079 
— 1 658 
— 0.344 
— 0.521 
2 990 

1.278 
— 0.144 
— 0 880 
0193 
— 0 199 


—1.473 
—0 851 
0.210 
1.268 
-0574 

— 0508 
— 0254 
—0921 
— 1.202 
— 0.288 


0 034 
0034 
-0.736 
— 1006 
— 0.491 

— 0.109 
0.574 
— 0509 
0.394 
1.810 


—2 127 
— 0 056 
1041 
-0.899 
—1.114 

— 02515 
— 0 451 
1.410 
—1.045 
1.378 


0 065 
0.340 
0 008 
0.110 
1.297 

—0.560 
—1.181 
—0.518 
0 843 
0 584 


—0.491 
0.219 
— 0.169 
— 0.098 
1039 

— 0.140 
— 1.698 
—1.041 
1.020 
1.047 

0.032 

0.151 

0.290 

0.973 

—0.289 

1.119 
—0.792 
0063 
0484 
1.045 
0 084 
— 0 086 
0.427 
—0.528 
—1.433 


2 923 
—1.190 
0192 
0.942 
1.216 


its present fm^n < c\v ChLcWn *1L L. °A l ^ ,e „ RAKD Cor P or ation. and appeared 
Operations Research, Wiley, New York. E J . L ; Production to 

is also reproduced by permission. ’ P ' ^ rom w b , eh the following example 
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described. (The random numbers used for this were not those of Table 12,2, 
but a similar set; therefore, do not look for a direct transformation between 
Tables 12,2 and 12.3.) 

Example: Two parts, A and B, have lifetimes that are normally dis- 
tributed, with the averages and standard deviations shown below: 


Part Average Lifetime Standard Deviation 

A 100 hours 20 hours 

B 90 hours 10 hours 

That is, our history about parts A and B gives us the experimental evi- 
dence summarized in these figures. Suppose an assembly were made of 
one part A and one part B , so put together that both A and B must work 
if the assembly is to work. What is the estimated lifetime of the 
assembly? 

Using Table 12.3 to answer this question, we start (again only for 
consistency of illustration ) in the upper left-hand comer and simulate 
ten assemblies by obtaining ten random numbers for each part. We 
convert these standard deviations into appropriate random variables 
using our knowledge of the average and standard deviation for each 
part. Because the assembly will fail if either part fails, the hfetune of 
the assembly will be the shorter of the two lifetimes so generated. Ihe 
results generated are summarized in Figure 12-8. To estimate tie average 
lifetime of the assembly, we average the figures in column (5), Figure 
12-8, to get 82.6. (Note that, as we would suspect, this value is less than 
the average life of either part.) We could also compute the standard 
deviation of the figures shown in column (5) to estimate the error due to 
sampling, and to make sample size calculations rf greater accuracy were 
re (l uired & In addition to showing the use of random normal numbers, 
example also illustrates how logical considerations may be aombtaed wtth 
probabilistic simulation, as did the Russian Roulette example of Frpme 
12-6. In general, complicated decision rules may be employed m a snnu- 
lation to 6 screen out combinations generated by the random number 
process. 


Some Computer 1"“ many other 
LSsTaS"— space, so that the desired result is often 

^^In^andom number generation, ^-“lled pseudo-randt^^numbers book 

is interesting to see how 

the process might be carried out. 
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Life of 

assembly 


Part A p * rt B 


Case No. 

U) 

Random 

normal 

number 

12) 

Rife of part 
= 100 + 20(1) 

{3> 

Random 

normal 

number 

(4) 

Life of part 
s= 90+ 10(2) 

(5) t 

Smallest value 
appearing 
in (2) or (4) 

1 

0464 

109.28 

0.137 

91J7 

91J7 

2 

0 060 

101.20 

-2.526 

64.74 

64.74 

3 

1.458 

129.72 

-0.354 

86.48 

88.40 

4 

1.022 

120.44 

-0 472 

85.28 

85.28 

5 

1.394 

127.88 

-0555 

84.45 

84.45 

6 

0906 

118.12 

— 0 513 

84 87 

84.87 

T 

\.\TO 

U358 

-1.055 

79.45 

79.45 

8 

—1.501 

69.98 

—0488 

85.12 

69 98 

9 

—0690 

88.20 

0.756 

97.56 

88.20 

10 

1.372 

127.44 

0.225 

92.25 

TOTAL 

AVERAGE 

92.25 

828.28 

82.63 


12-8 Simulation ot assembly lifetime using random normal numbers. Two parts, A 
and B, must both work If the assembly of A and 8 is to work. The lifetime 0 1 the 
assembly ts thus the shortest ot the lifetimes ot A or 8- tn this example ten assemblies 
are simulated using random norma! numbers. The estimate of the average lifetime 
ot the assembly Is obtained by averaging column (5) for the ten cases. The error of 
the estimate may also be computed using the figures of column (5). Average assembly 
lifetime obtained from this simulation Is 82.6 hours. (Data adapted from C. W. 
Churchman, R. L. Ackoff, and E. L Arnoff, Eds., Introduction to Operations Research, 
Wiley, New Vork, 1957. p. 182.) 


The midsquare method illustrates a typical approach. With this proce- 
dure a starting number b t is selected. For illustration, assume a four-digit 
number. This number is squared, producing a maximum of eight digits, The 
middle four digits of this eight-digit number are then extracted to produce 
a second four-digit number. The process is then repeated to generate suc- 
cessive four-digit numbers. These numbers will appear to be “random” to a 
large variety of tests often used for randomness (average, standard devia- 
tion, uniform probability of digits, run tests, and the like), and may be 
considered random numbers for simulation purposes. Thus, if b , = 1097, 
then bj* = 01203409. Extraction of the middle four digits gives b* — 
2304, and the process is continued. Although procedures of this type will 
eventually produce a repetition of the series (e.g., if 1097 should ever come 
up again in the course of number generation), nonrepeating strings of 
sufficient length for many simulation problems may be generated by careful 
choice of the starting number. More complex procedures may also be used 
to generate extensive random strings. 1 Random normal numbers may also 
be generated in this way by computing the sum of N random numbers. 
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which by virtue of the Central Limit Theorem in Statistics will be normally 
distributed. The normally distributed sum is then transformed to the stan- 
dardized form with x — 0, «r = 1. 

Finally, we should note that most computer centers have subroutines 
available for random number and random normal number generation, so 
that in a computer implementation of a given simulation these results may 
be used as blocks, and called as required in the construction of a computer 
program. 


•Computer Simulation Languages 

Because of the detailed discussion required and the availability of alternate 
references, we do not intend here to evaluate or discuss computer lan* 
guages. 8 

However, to make the reader aware of the major features of these impor* 
tant aids to large-scale system simulation, we will outline current practice. 
(We will assume in this section that the reader is familiar with introductory 

computer terms; if not this section may be skipped.) 

Today computer programers commonly work with so-called mgher- 
level” computer languages (FORTRAN, ALGOL, COBOL, AUTOCODER) 
rather than with more detailed “machine language" (so that the machine 
may be instructed more simply and with less programer effort for general 
problems in science and business). The systems analyst, when designing a 
computer systems simulation, will also use a compiler. Moreover, because 
a system simulation of any complexity often involves an extensive list o 
computer statements, many of which may be repetitive, a compi er ap 
proach with many subroutines is desirable. The reader will recall that 
computer languages, such as FORTRAN. ALGOL. ' an . .. 

CODER, correspond roughly to an encyclopedia A single statement in th 
"higher-lever language usually generates, by looking up and comp i ig 
more detailed insertions required in the proper order; . _ much longer hrf 
of detailed statements. For example, the statement MULTIPLY RATE 
- - series or instructions, nrst 


TIMES HOURS TO GIVE WAGE” may require 


liMas nuuna iuvul — * — ■* , * . hotirq m in 

to detennine the value of the RATE and the number of HOURS to use in 

, M „„,_n v re oIaced with more involved routines to 

The simple midsquare method isu Hy procedures are stored as com- 

obtam longer cycles of pseudo-random Dobell, "Random Number 

puter subroutines for use as needed Seel. b nuua 

Generators/ SIAM Review, 4, , Ju y> » {}ie following general references are of 
*In addition to the speaBc l ^Burdick, and K. Chu, Computer Simulation 

interest; T. H. Naylor, J. G. £3im ry. , f M j kn am j R. F. Gonzalez, Systems Analy- 
Techniques , W.ley, New York ^^ion MoS Imin, Inc., Homewood, IU. 1905. 
Sis- A Computer Approach to ^ {£(n Aca< Jermc Press, New York, 1965. II. S. 
D. N. Chora#*, ^f^uj£ Q S ~rh e Fast, Present, and Future of Simulation Lan- 
Krasnow ana n A. • . xafU 

guages/' Management Science, v , 
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the computation, and then to perform the repetetive addition, overflow, 
and other chechs necessary for the computer to MULTIPLY, and then store 
the answer in WAGE. When a compiler routine is used with the higher- 
level language, the computer itself does this look-up, translation, and com- 
pilation, thus saving the programer the job of doing the same. 

Special compilers and languages for simulation problems are used, 
because often the operations needed are not part of the usual general 
purpose compiler, although, of course with more difficulty, the general pur- 
pose compiler could be used* Some of the presently available simula- 
tion languages, which we shall discuss presently, exploit the advantages 
of general purpose compilers, such as FORTRAN, but add to them many 
additional features, such as timing routines, random number generators, 
output report generators, matrix manipulation, and notationai ability parti- 
cularly suited to simulation. Often this added ability requires an additional 
translation, or compilation. For example, the simulation language known 
as SIMSCRIPT permits instructions to be written in its own language or 
in its own language intermixed with FORTRAN. These statements are then 
compiled into a more extensive list of all-FORTRAN statements, and then 
for a particular computer to be used foT computation yet additional com- 
pilation and translation is required. For the IBM 7090/94 machine, the 
FORTRAN statements would be compiled into a move extensive list of 
symbolic statements and finally into the binary codes the computer can 
use. Although the program was originally written in SIMSCRIPT, it would 
be executed in binary machine language. These compilations are relatively 
complicated, but from the user’s point of view they are not burdensome, 
because they are performed by the machine. 

The three simulation languages we will discuss are DYNAMO, GPSS-III, 
and SIMSCRIPT. The compiler tapes required for these simulation languages 
are widely available, as are the detailed instructions for their use. 

DYNAMO*® is the name given to a computer language designed for the 
simulation of systems most easily described by flows with lags and rates of 
change of flow- In mathematical form, such systems might be described by 
differential equations, or equivalent difference equations that approximate 
the continuous case. A number of decision rules may also be used. An illus- 
tration of such a system is provided by the over-all statistics of a manufac- 
turing concern, such as total dollar sales, total dollar inventory, total dollar 
receivables, and the like. In such gross statistics we are mainly interested 
in the major relations that exist in cash flow, material flow, and the transla- 


tor example, for use of FORTRAN fa simulation programs, see McMillan and 
Gonzales, op. eft. 

-Jay Fok«*«. Dynamics, MJ.T . Press, Cambridge, Mass., 1961. The author 

describes the suitable types of problems, the method of simulation using DYNAMO, 
and prides numerous lllmtrabon,. A. L. Pugh, DYNAMO User', Manuel. 2nd ed., 
MJ.T. press. Cambridge, Mass , 1963. 
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tion of one gross measure into another— without the detailed definition of 
specific internal operations and the computation and display of detailed 
statistics. For example, as a simple case, if orders received take two weeks 
to produce and ship, and collection lags another six weeks, and raw material 
inventory must represent a two-week supply, how will the financial picture 
of the finn vary if sales have been steady at $10,000 per week for 20 weeks 
and then begin to increase at a rate of $1000 per week for the next 20 weeks? 
How will receivables and delays due to raw material shortages increase? 

In DYNAMO, a description of the problem is written in a special nota- 
tion that facilitates this type of problem formulation. In addition, the lan- 
guage has a special report generator that will plot, on a standard computer 
printer, selected variables as a function of time, i.e., time series, for analysis. 
These abilities allow the investigator to study the “dynamics of the total 
system by an analysis of the trends and patterns in the time series so 


pictured. 

DYNAMO can also handle variations in delay and feedback processes, 
such as computed production decision-rules based on prior system per- 
formance. In short, DYNAMO is congenial to the analyst who is familiar 
with the literature of servomechanisms and the design of automatic control 
devices, and who thinks naturally in the language and methodology of that 
field. In many industrial situations, simulation of manufacturing, distribu- 
tion, marketing, and financial problems may be facilitated by this approach, 
as Forrester indicates in his widely-quoted book. 

GPSS-III (which stands for General Purpose System Simulator, Im- 
proved Edition) and SIMSCRIPT are, by contrast, designed for more de- 
tailed simulation of specific operations and hardware systems. Both derive 
motivation and technique from earlier attempts to create a general purpose 
simulation language that would handle a large number of ,ob-shop type 
problems in which many changes in flow attributes or ronUng must be 
handled, often in combination with extensive analysis of waiting hues or 
queues. The two languages differ markedly, however, m their approach to 

SyS CPst-m lTan IBM compiler 1 ' that approaches the definition of the 
system to be simulated by providing for the analyses 

“blocks” representing common operations to be simulated ■ These compo- 
nents" may be put together in block diagram /omi (as descnbed in . C “f lc / 
nems may ot, pui i ± system to be simulated. From this block 

2) to form a bloc instructions may be coded relatively easily, 

diagram, the required lompum r , instruct .on^ ^ of b]ocJ; 

types available with* GPSS-III permits the rapid combinatorial construction 


General Purpose Systems Simulator 111 { #B20-0001-0), 
"IBM Application Program- ^ ^ Plains, New York. Also users manual 

IBM Data Processing Division, 

#H2O-OI03. 
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of a wide variety of system types. This speed of simulation is a definite 
advantage, as is the fact that the block diagram constructed for this com- 
piler bears close physical resemblance to the block diagram the analyst 
might use if he created a noncomputer block diagram based on physical 
observation and described the operations to be performed in ordinary Eng- 
lish. In some cases, however, the blocks provided for construction of 
GPSS-II programs will not accomplish what the analyst wants, or will do 
so only in a cumbersome way. 

The SIMSCRIPT language 1 * attempts to provide the most general form 
of simulation compiler, and uses the listing approach to definition, as pre- 
viously described in Chapter 4. By defining the entities (temporary or 
permanent) that make up a system as a hierarchy of categories, together 
with the attributes and set membership or ownership of each, SIMSCRIPT 
provides a general method for describing system Status at a given time. The 
extensive timing routine and event-notice listing procedures required with 
this form of definition form part of the compiler package, so that the analyst 
is relieved of the burden of their construction and maintenance. This com- 
piler has many special features that make for economy in the use of com- 
puter memory, provide extensive tabular report generation and statistical 
computation with a few instructions, and allow many system parameters to 
be specified at the time a given simulation run is executed (thus eliminating 
recompilation of the program each time a variation in experimental condi- 
tions is desired). These features, its FORTRAN compatibility and its wide 
availability, probably make SIMSCRIPT the most popular general purpose 
simulation language available today. 

The beginner, however, may be somewhat overawed by the multitude of 
special features available with this compiler, and may also have difficulty 
with the special conditions required to create the specific format used in de- 
fining a system with SIMSCRIPT. For example, SIMSCRIPT requires the 
definition of some entities and events of different types that do not at first 
appear to be related to the physical characteristics of a given system. Yet, 
these details are usually required for record-keeping, timing, and statistical 
summary, and provide much of the power SIMSCRIPT has for simulation 
of complex systems. 


For a ■ further rtanmron of simulation languages, the reader b referred 
to the references Med. It „ reasonable to assume that proficiency in the nse 
of any srurnktron language (or any form of computer programing) comes 
largely truth , erpenence u. developing and running specific probkms on a 
speoficmachme-as well as from detailed study of user's manuals. Yet the 


"H, M. Markowitz, B. Hauser, and H. W cruerm,™ . „ 

ming Language, A RAND Corporation Pubktahn ifS 5 Rl ^ T ' £ Simulation Trograi 
N.J., 1963. This is a user’s manual Sdhia 1™, Pr ?'i.’ ce '** a ^ Englewood Clii 

•to die meeffle manuals prepared f„ , g > S 
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effort expended by the analyst (say 40 hours) to acquire introductory 
skills with a given simulation language is greatly rewarded by its possible 
applications. Even though the analyst does not intend to enter into the 
details of computer programing, a knowledge of what can be done and 
the form of definition required will enable him to convey his problem more 
sensibly to a professional programer. What we have learned about the 
definition and analysis of systems will be of major benefit to further study 
and cooperation with computer specialists. 

We must always remember that simulation languages and the use of 
machines to mechanize a system simulation are not substitutes for the 
analyst s understanding of his system. A computer simulation is no substitute 
for a bad system definition or a poorly formulated problem. Rather, such 
fundamental confusion is greatly exaggerated by the mechanics of com- 
puter simulation, which should be reserved for detailed experimentation 
with an already sound and clearly understood system definition. 

A Blackboard Simulation Exercise 

To illustrate a verification that is desirable before a simulation is mech- 
anized, let us return to the job-shop example of Table 12-1. In working out 
the completion times for each job using the time-position chart, many of 
the detailed steps required were worked out by inspection, or by a few 
computations on a scratch pad. Although some care had to be exercised, 
particularly with the priority of work assignment at a given machine, no 
detailed statistical records were required. 

Let us repeat that example and perform the same simulation, but let us 
now set up the procedure to follow more nearly the detailed steps that might 
be performed by a computer. Our organization of work will approximate a 
SIMSCRIPT procedure, but will forgo the notational detail employed by 
that language. 

To start we wffl draw up a shop status board, Figure 12-9a, which shows 
in complete detail the organization of the job shop. The shop consists of 
four machines, so we make up a major column for each machine. We may 
also call the Move step an operation (essentially equivalent to Machine 5, 
if we think of material handling equipment) and make up another column 
for it. The last major column is called Finish, to represent the completion 
operation (essentially equivalent to a storage operation), and so may be 
thought of as Machine 6. The six major columns now define the six perma- 
nent divisions, or entities, comprising our shop. 

Some consideration will reveal that we will need to make three additional 
distinctions at Machines 1 through 4, namely, whether a Job is waiting in 
a backlog Queue, being Set up, or Running. For this reason, we provide 
three subcolumns under each Machine 1 through 4. 




12-9 (a) Table defining system hierarchy lor simulation, (b) Job ticket required (or 
routing and timing, (c) Event List required to anticipate next event (d) Historical record 
ot what has happened {finish times only are shown). 
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Next, thinking ahead as to how the status of the shop may look at any 
time, we realize that a backlog or a Queue at any machine could consist of 
several jobs (or none), but that at any time only one order (or none) may 
be Sef up or Hun, Therefore, we provide several rows in each Queue column, 
but only one row under Set up and Run. For a similar reason, we provide 
several rows under Move and Finish. 


We may now think of our Shop Status Board as being similar to that 
used by many shop dispatchers in controlling a shop manually. In such 
applications, each small square shown in Figure 12-9 is a small pigeonhole 
or pocket into which a record about a given job may be inserted, the posi- 
tion of the job card telling where it is, and the job-card description provid- 
ing the detail required to identify a job, tell where it has been, where it 
is going, and the processing time required at each step. We will use this 
Status Board as a picture of the shop, which will of course change as time 
goes on. (Such a board may be physically constructed on paper, out of 
metal, or drawn on a blackboard). In the computer the pigeonholes would 
he memory locations. 

Because there are four jobs to be completed, we will need four job 
tickets to start, as shown for Job A in Figure 12-9b. These job tickets corre- 
spond to the routing sheets or cards often found physically attached to 
in-process work in a shop. We will use them to indicate movements in status 
and to compute operation, move, and completion times. Note that the job 
ticket consists of two parts: An identification line giving the job name and 
amount, and a lower section which provides the routing required (progress 
tag sequentially downward on the card) and the setup and running tunes 
on each successive machine. In the definition line, a square a e e ar 
has been included. At the start of the simulation, when the |ob is first 
assigned, we will refer to the job as Job A, Part 1. Remem er, owever, a 
the Inspection Operation will reject a portion of the jo or rewor ' . 

*is is done the job is split into two parts, one of which will go tc 'finish 
and one to rework. With this job split we need a double identifica n 
record-keeping, and will therefore refer to the rejected portion of the job 
as Job A, Part 2. The same rule will hold for the °*er lohs_ (Dunng the 
simulation there may be more than four job tickets in the shop. The new 
tickets will he created as may be necessary.) , . 

The second column on the job ticket indicates the numbe of hours for 
setup at each machine. The third column gives 

per unit of product. Machine 4, the furnace, is a hateh op ' " d 

Data for the job ticket comes from the specifications previously presented 

in Table 12.1. r me shoo category to another, we 

To control the time of movements fl0m jf. llc Although a sample 

will need a Next Event List, as shown in Figure U lie k 

line is shown, this Next Event List will be clean at the start ot 
lation. 
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Similarly, to compile the statistics required for a display of completion 
times, we need a Statistical Record on which we will record the required 
data, which otherwise might be lost during the course of the simulation. 

With these forms in hand, the mechanics required for record-keeping 
and for reporting systems status have been defined. However, the rules for 
simulating the movement of orders through the shop remain to be clarified, 
and a detailed statement of what is to be done in the exact sequence that 
is to be followed, must be developed. As we shall see, this is no simple 
matter. 


Examination of the graphic solution already completed reveals that a 
number of different events can occur. For example, a given job can arrive 
at a given machine and join a queue. A job in a queue can leave it. Setup 
can start and be completed. A run can start and be completed. A move can 
be started, or completed, and so on. Each start or completion of an action, 
regardless of how minor the apparent action may appear to be, causes a 
change in system status, by definition. Moreover, the sequence with which 
these events takes place is dependent upon what is going on elsewhere in 
the system. What happens when two orders arrive at a machine at the 
same instant with equal or different priorities? 

The problems of defining all the rules of the operation would be less 
serious if only one specific problem were to be run. For example, knowing 
the job-shop problem as originally posed, and hasing the sequence of 
system status pictured in Figure 12-4, we see that, for the input used, queu- 
ing appears only at Machine 2, and not at the other machines. But, in 
general queuing could occur at any of the machines so that the rules de- 
veloped for the simulation must take into account all possible contingencies 
or a range o possible job inputs. This necessity to be exhaustively com- 
p ete m the specification of the simulated system presents a logical burden 
upon the analyst and usually requires much trial-and-error development in 
° f m f han i zed simulation. While making these specifications, 
k , eep h “ T 0 ft' specific process step under considera- 
tion and also on the over-all workings elsewhere in the system. 

lo 0 ktoaTAh UIm ' ra M * e de 'f m!Uired and ,he ,0 S ieaI development by 
Sit,™ JT T”? tha ' C ° Uld '“‘“8 them. (Often the 

influenced h ,h 6 Uc u ' mt to create in the simulation will be 

Mows we O, re “ rd 'f c P“g desired for statistical analysis. In what 

may be Z Cd Ma,yjis ° ! “tup. a " d times 

may be desired m addition to completion times 1 

ibg a d ; agra “ ° ! h0W ,he ««“»“ of events determin- 

(mla oa ^”nT^ a ,c W °“' d ^ “"trolled by the human analyst 
^Aboard S !f ' ” Bo " d “ d «he other forms of Figure 
may occu J from time tn !?. me s y stem definition. Different events 

connectors B through X JJld,ca ^ W the blocks labeled with the 

through X. The sequence of these events will change, however. 
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EVENT SETUP 
MACHINE (I) 


SCHEDULE RUN 
AT r=r PLUS 
SETUP TIME 
FOR THIS JOB 




EVENT RUN 
MACHINE (1) 

mu 


RECORD SETUP 
FINISH TIME THIS 
JOB. RECORD RUN 
START TIME. 

AND SCHEDULE 
COMPLETION ATT=T 
PLUS PROCESS 
TIME THIS JOB 
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depending upon the external input and the charactemt.c of the jstem. 
BecLe the simulation must proceed in a strictly sequent, al way w « 
strictly sequence the events that mil occur. A chain o{ events of thfa tyP 
1 L generated by the Next Event List. In effect .tins list select! lor 
switches our attention and effort to the next sequentially scheduled event 
and its associated manipulations and computations. As a result one or more 
future events will be scheduled on the Evcnl Notice List, which always wall 
return control, after completion of the immediate event execution and its 
associated routine, to the Next Event List and its associated control pro- 


ceaures. , . , .. 

Each event block in Figure 12-10a thus initiates a series of manipulations 
that are carried out when the event occurs. These manipulations separate 
blocks, or subroutines, in the simulation which the timing routine using the 
Event Notice List pieces together, as required-following exactly the proce- 
dure a human analyst would perform using the Status Hoard and associated 
forms of Figure 12-9. The fact that the simulation can be cut up into mod- 
ules, each of which may be developed separately, is a definite advantage 
in planning the simulation- Thus, although there may be a gre3t variety of 
specific and distinct events, each requiring the execution of a different com- 
putation, we can specify each of these computational modules or subrou- 
tines one at a time, just as we could specify for the human analyst proce- 
dures for each event on different pages of a manual. In this way the possi- 
ble complexity of a system can be segmented so that it is more easily handled 
by the analyst (or team of analysts). As we shall see momentarily, the seg- 
ments of procedure that follow a given event may be trivial or relatively 
difficult. However, the combinatorial ability of these segments to generate 
variety permits versatility in simulating a system. 

To be specific, as we view the jab-shop example, we see that work which 
arrives at Machines 1 and 2 will be handled in a straightforward manner. 
There will always be a setup (by reference to the specifications), there are 
no batching problems, and there is no problem of product rejection, as is 
the case with the inspection step. This suggests that one procedure be 
developed for Machines 1 and 2, and that a different procedure be required 
for Machines 3 and 4, and for other operations such as Move and Finish. 

First, consider the setup and run operations for Machines 1 and 2, as 
indicated in Figure 12-10b. These events, when the scheduled time has 
arrived, need only schedule new events. The completion of a setup always 
?™J“ a ran - a nm is scheduled on the Next Event 

List The completion of a run can result in a number of different actions, 
depending upon current system status, but this will be taken cate of at the 
proper tone if a completion „ otic e is posted and control returned to the 
Next Event List as shown. 


The procedural 

or 2, or when a job 


steps necessary when an arrival occurs at Machines 1 
is completed on Machines 1 or 2, are more interesting 
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EVENT ARRIVAL 
AT MACHINE (I) j 


/ARE ANY OTHER \ 

' JOSS schepuled X Y£ S 
I FOR ARRIVAL AT / 

YTHIS TIME? / / — 


ASSIGN JOB TO YES/ 
OUEDE FROM SOURCE / 1 
AND RECORO IN- ~*\ 
QUEUE TIME AND V> 

SOURCE-OUT TIME \ 


ASSIGN JOBS TO YES/ \ 

QUEUE FROM SOURCE ) / ,s SETUP \ 

AND RECORD IN- H*\ 0 R RUN ) 
QUEUE TIME AND XOCCUPIED/ 

SOURCE- OUT TIME / ' , / 
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or 2 * JZZ?' m ° b 4 V°"° Wed aRef the completion of a job at Machine 
° Jy!! !*° P eump,e After com P’ el,on - control again returns to the ne: 

event «,st Hot* that job completion will shorten the queue i! it is not empty 
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I are illustrated in block diagram form in Figures 12-11 and 12-12 re- 

f spectively. Some careful analysis is required in each of these cases to pick 

f U P all the logical possibilities that could occur, and to specify the appro- 

priate action to take for each possibility. 

For example, in Figure 12-11, we may think of the various possibilities 
that can occur. There will be a different situation if 

1. More than one arrival comes in at a given instant (or not), 

2. The queue is empty (or not), and 

3. Setup or Run is occupied ( or not ) . » 

Because these are yes-no questions, and there are three statements, our 
knowledge of Chapter 11 indicates that there should be 2 a distinct possibili- 
ties to take care of. However, we may note that the three statements are 
not logically independent: If the queue is not empty, then setup or run must 
be occupied because we would naturally move the work along if the machine 
were idle. That there are only six distinct conditions may be seen from the 
complete logical basis for the three statements, in which the required Os 
for setup and run full are shown. 

Condition Combination 

1234 5678 

Other jobs arrived 0101 0101 

Queue empty 0 0 1 1 0 0 1 1 

Setup or run full 0000 1111 

Condition Combinations 1 and 2 are thus impossible and may be eliminated, 
resulting in six distinct combinations. Each of these six possibilities is taken 
care of by the decision trees of Figure 12-11. 

In Figure 12-12 a similar approach may be used. At the time of a job 
completion, we normally would move up to setup anything in the queue. 
However, at that exact instant of completion the Next Event List might 
contain one or more arrivals to occur at the same instant. Should this be 
true, some of the members of the present queue could be bumped by a 
higher priority amval(s); therefore, such possibilities must be taken into 
account, . , .. 

Noting as before that if the number of scheduled arrivals in the Next 
Event List is not greater than zero, the number can certainly not be greater 
than one, we can construct a logical basis of the possibilities, inserting zeros 
where required. 

Condition Combination 

1 2 3 4 5 6 7 8 

0101 0101 
0011 0011 


Arrivals > 0 
Queue empty 
More than one 
arrival 


0000 1111 
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Here Conditions 5 and 7 are impossible and may be eliminated, lea g 
distinctions. In addition. Condition 6 (arrivals greater than zero but qu 
not empty) provides one more irrelevant condition, because if the, quel 
is not empty one or more arrivals will join ibe queue, awl Condition 6 and 
may be grouped. The Eve distinctions that remain are taken care of by tl 
decision tree in Figure 12-12. . 

An examination of all foe logical possibilities at each stage offoesimula- 
tion development is useful to prevent omission of a test combination that 
should have been included. Such omission is a common error ro dehnmg the 
simulation steps. Although completeness is not so strictly demanded in 
manual simulation, because the analyst can devise special rules as needed 
during the course of the simulation, the computer does not have this im- 
provising ability and must he supplied with instructions for every con- 
tingency, fust as a legal contract hopes to cover all pertinent conditions 
before they obtain. 

Returning now to Figure 12-10, we see that to completely specify the job- 
shop simulation we will need “Event Routines” for operations on Machines 
3 and 4. The presently defined routines will not do, because Machine 3 (in- 
spection ) requires a special series of steps to reject 2 OX of the job, split foe 
fob in two parts, and route each separately from there on. Machine 4, which 
is a batch operation, requires special tests for foe number of batches to run 
for each fob, and a different processing time computation. Similarly, the 
Event Finish requires statistical record beeping and a test for completion 
of the simulation, and also the initial arrival of Jobs A, B, C, D at Machine 
1 must be scheduled. (Construction of these additional flow charts Is left as 
an exercise.) 

With the detailed instructions for each type of event routine in hand, 
say On different pages of a procedural manual, the combination of these 
routines may be tested manually. For example, we may repeat, in detail, 
the sequence of events illustrated in Figure 12-4. If the detailed specifica- 
tions are correct, the results should be the same as before. In this test proce- 
dure (which is also left as an exercise) the analyst will find it convenient 
to record each scheduled event in the Next Event List, and then cross out 
foat event as it is executed. This will provide a complete listing ol what has 
happened in case further analysis is required. 

Some remarks are now in order. First, it would he possible to introduce 
randomness in any of the "Schedule Event” computations by means of foe 
Monte Carlo procedure described in the previous section. In foe flow dia- 
grams shown ( Figures 12-lOb, 12-11, and 12-12) such randomness has not 
been included m the simulation. Next, to make foe Event Routines as 
general as possible, we note that it would be convenient if we had some 
intlcxmg ability in "tagging” fobs, selecting order attribute data for test, 

,h 8 lW> “- F ° r such indexing ability is required 

in the manual case as the jobs move from one machine to another in the 



Selecting Simulation Test Variations 403 


job-shop simulation, because each successive machine requires different 
setup and processing data (obtained by reference to the next line on the 
job ticket). Simulation languages, such as SIMSCRIPT, provide this index- 
ing ability (although we have not indicated the notational details in the 
flow chart illustrations of Figures 12-10 through 12-12). We should also note 
that many different Event Routines could be developed to accomplish the 
same ends. For example, it would be possible to handle the different 
machine types in two Event Routines, like Figures 12-11 and 12-12 for 
arrival and completion— suitably complicating each by adding additional 
tests for “Machine Type” and providing the alternate actions required for 
each. The choice of Event Routine organization is a matter of convenience, 
and depends largely upon the purposes for which the simulation has been 
designed. 


SELECTING SIMULATION TEST VARIATIONS 
Exercising the System 

To see if the simulation procedures designed will work for the entire range 
of possible input cases, it is common to “exercise the system” by developing 
a series of test inputs that will provide a diagnostic routine for each segment 
of the simulation program. As described in Chapter 10, if the system can 
be segmented into a number of independent divisions, each may be tested 
separately by developing special input cases, or changing the values of 
initial conditions at each division. In the j'ob-shop example, we would be 
particularly interested in whether or not the queuing and priority rules 
worked satisfactorily at each machine, and we could test this by feeding 
a series of j’obs with different priorities directly to a given machine, or to 
the generalized routine for j'ob arrivals at Machine (I). The other methods 
of Chapters 10 and 11 may also be used for this purpose. In preparing such 
test data, and test routines, it is, of course, essential that we know what 
should happen in response to each test. For example, if a job arrives at 
Machine A and a queue is present, then that job should join the queue and 
eventually work its way through setup and run at Machine A. Should this 
required result fail to occur, we know something is wrong, and the specific 
failure provides a symptom for diagnosis of the timing sequence or Event 
Routine construction. 

Having satisfied ourselves that the simulated system is logically correct, 
we may explore the system further. 

Validation of the System 

If our purpose is to simulate a system in the real world (rather than 
study the logical operations of the logically defined system set up for 
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mminulation) then it is necessary to confirm that the results of the Simula 
, predict what will happen in reality under the same assumptions. F 
1 . , « A ^^nV.f.nn times such as those 


:^:ifir^Uted system predicts completion times su£ as t 
. - p . . — validating result would be the observa 


displayed iu Figure 12*4, then one « 

tim of similar completion Umcs for the same input s«,uenco and imUa 
conditions in a real job shop. The more such eo^rmmg results the anal>st 
ran obtain for a variety of cases, the greater confidence he may have in the 

realism of his simulation model. . 

The analyst should not be dismayed, however, to find that in systems 
controlled by human operators, predicted and actual results may difter 
widely. This is often the case because in the real world shop the rules 
allocating and sequencing work are often followed inconsistently, or various 
improvisations and emergency conditions (machine breakdown, faulty 
material, incorrect machine settings, lost jobs and so on) may occur to 
disrupt the flow of work. In some cases the realism of the simulation model 
may be improved by further complicating it, i.e., by adding the major con- 
tingencies that occur in the real world, but which have been left out of the 
simulation model for simplicity. Where further complication is intolerable 
(due to lack of memory capacity, greatly increased running times, and 
replication requirements for statistical reliability, etc.), something may ypt 
be learned from the simulation model about what would occur under more 
ideal conditions of shop conformity to plan. 

It should also not be surprising that it is easier to validate a simulation 
model for hardware controlled, or highly logical, systems than it is for 
human controlled systems. Thus, automated systems, such as telephone 
networks, computer controlled machines and processes, and electromechani- 
cal systems provide logical structures that are amenable to the simulation 
approach. Hybrid simulation models, which combine both human operators 
and some computer or hardware processing (previously called Operational 
Simulation), offer another route to improved validity. In such hybrid simula- 
tions we assume that the human operators will respond in practice as they 
do at simulation time, which may or may not be true. 15 


Heuristic Selection o! Test Cases 


Because many possible test cases could be used (which would possihl 
be interesting) in manipulating a simulation model, and because logic; 
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screening of conditions may be difficult, experimental conditions are often 
selected by judgment. In such “heuristic” selection of test cases, the analyst 
picks out experimental conditions or experimental decision rules that seem 
interesting, or likely to work, or that have worked in the past, and seeks 
to evaluate such proposals using the simulation model. For example, in 
the job-shop example, the analyst may wish to compare the present priority 
rule (determined by the last machine a job came from) with a rule that is 
determined by (1) promise date, (2) shortest job in queue, (3) first-in first- 
out in queue, (a FIFO policy), (4) last-in first-out (a LIFO policy), (5) 
the length of the queue for the next machine in a job route, or (6) a com- 
puted rank, depending upon, say, the value of the job (i.e., dollars tied up 
in in-process inventory) and the probability of shortage of that item in 
finished inventory. An evaluation of these possibilities might then proceed 
for a wide variety of possible product-mix loads arriving at the shop, and 
the rule that optimized a stated criterion (such as maximum machine utiliza- 
tion, shortest total processing time, minimum dollars in-process, or a com- 
puted criterion specified in advance) could be selected. Thus, from the 
infinitely large number of possible priority rules, a set of reasonable ones 
is adopted for evaluation. 

Parametric Variation of Test Cases 

In many simulation problems a question of data sensitivity arises which 
is often answered by variation of the data inputs and a computation of the 
resulting effects upon system output. This procedure, often called "sensitivity 
analysis” or "parametric experimentation,” may proceed in a number of 
ways. For example, the variations proposed may he taken at random, in a 
systematic way, or they may be taken at the extremes of a range of interest. 

In the job-shop illustration, we may be interested in the effects of move 
time upon the sequential status of the shop and the resulting variation in 
;ob completion times. Thus, we may introduce a random move time having 
a given average, standard deviation, and distribution, and generate a series 
of variable move times using the Monte Carlo method. Or, we may systemat- 
ically increment and decrement the stated one-hour move time by quarter- 
hour intervals. Or, we may ask for an evaluation in which the move time 
was in error by 20 %, and concentrate only on those extremes. Should the 
results (completion times) not be greatly affected by such variations in the 
parameter move time, then we could conclude that this was not a critical 
parameter in the model, or vice versa. Again the number of parameters 
that can be so varied in any complex simulation soon becomes astronomical, 
so that some selection is required when picking combinations of test 
parameters. 

A similar scries of experimental variations could be planned for param- 
eters involving machine capacity, processing times, and the like. 



<0* System SfflwteHon 


Evaluation ct Alternate System Designs 

Another source of test cases is the availability of alternate proposals or 
system designs for the performance of a given tash, or the processing o 
given set of taputs. An obvious approach here is to write “ taroa " ! 
programs for each proposed design, and to test these alternate systems under 

identical input conditions- . . . „ . • ,1 

A less obvious but mote fruitful approach is to design the on S 1 ^ 1 
simulation model so that alternate design structures can be obtained by 
the specification of a limited number of control statements, which will, in 
effect, restructure the simulation model’s sequence of operation and thereby 
simulate the alternate systems, as required. If the class of design alternatives 
to be evaluated can be anticipated at the outset, much work can be saved 
by the use of the more general simulation model. For example, if the various 
operations that might be performed in alternate system designs are antici- 
pated, they may be described in subroutines and pieced together in the 
proper order by a generalized control routine, with control parameters to 
be selected at execution time, depending upon the alternate design to be 
evaluated. The selection of alternate designs remains a problem for the 
analyst, but their evaluation is greatly simplified by this approach. 


50ME COMMENTS IN CONCLUSION 

Although we have employed a job-shop example in this chapter to illustrate 
the timing problems in some highly conditional system simulations, what 
has been said applies also to the use of system simulators in general. By 
describing the range of possible simulation mechanics— from manual to 
highly automated-it was our intent to provide the reader with a feeling for 
the generality of the approach, and to illustrate that the automated simula- 
tion is an extension of the manual simulation which greatly facilitates evalua- 
tion and manipulation, but which also places increased burdens of precise 
definition and logical consistency and completeness upon the analyst. It was 
alto our intent to dwell on the thinking required in the development of 
a simulation model, rather tfun to become engrossed in the details of a 
giv«\ simulation language for a computer or the hardware that might be 
used to implement the logical model. Because most simulation difficulties 
spring from the definition and specification stage of model development 
and from inadequate planning for the desired results, our discussion com- 
plements the more detailed discussion o\ simulation languages and me- 
chanics of simulation available in the references cited. For example, after 
cWte - *' raiw >'■»« a Beater appreciate for the 
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We have also attempted to provide cautionary remarks about the use of 
system simulation, and to warn that, although it is a powerful and popular 
technique for systems analysis, simulation is not a universally suitable ap- 
proach to human training, the development of better decision rules, the 
selection of human operators or system designs, or the evaluation of all types 
of proposed systems. The assumptions, requirements, and stability condi- 
tions demanded by a simulation may not be present in a system to be 
simulated; the resources required may not be justified by the case at hand; 
the analyst may not know what sort of results to print out; there simply 
may not be time to set up a simulation and experiment. Eugene Burdick 
expressed these limitations dramatically in his novel, The 480, as Curver 
realizes his formerly effective political simulation is useless under a shift in 
convention strategy. 

Curver {eh physically ill. Three months ago, even a few days ago, it had all 
been neat and scientific and well thought out. He and Dev had been close to 
doing the impossible . . . running an unknown for the Republican nomination 
against a gang of millionaires. And they had done it silently, cheaply, scienti- 
fically, and— almost— successfully. Now, because of Kelly, they were going 
back to the messy, old-fashioned, haranguing days he had read about in books 
and winced. It was too late to program anything about this new situation 
through the 7094. They would be flying on intuition and snap judgments from 
senile old bastards like Lander." 

In his bag of tools, the systems analyst has simulation-even mechanized 
simulation— as a possibility. 15 But there are many other possibilities. And the 
analyst should not be led to believe that simulation is the only form of 
systems analysis. The experienced analyst does not make this, sometimes 
fatal, mistake. 


PROBLEMS 

12.1 Why would it be desirable to simulate the control and logic system of a 
new elevator installation or a new elevator control system design, before 
the units are constructed and installed? What savings could be made by 
using the simulation technique in this case? 

12.2 Again consider the coin-changing vending machine of Chapter 2, Program 
the logical diagram given in Chapter 2 for an electronic computer, so that 

"Eugene Burdick, The 480, McGraw-Hill, New York, 1964, p 259 

“For a bibliography, see M. Shubik, "Bibliography on Simulation, Caming, Artificial 
Intelligence and Allied Topics," ]. Am Statist. Assoc., 55, 104, December, I960. For 
current developments consult Computing Reviews and The Communications of the 
A C.M., both published by the Association for Computing Machinery, New York, and 
international Abstracts in Operations Research, distributed by the Operations Hescarch 
Society of America, Baltimore, hid. 
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the operation of the machine may be simulated. Test the logic of the 
sending machine diagram by mating op, or generating, a random sequent* 
of inputs of nictels, dimes, quarters, pennies, and slugs. Shorn as a printed 
output from this simulation the coin input and the machine output (product 
and/or coin) for each transaction completed, 

A. Are there any coin input combinations that lead to an incorrect action? 

B. In a problem of this type, is there any necessity of introducing a random 
input for test, or could an exhaustive list of all possible combinations at 
the input he used instead? 

C. After which test (random or exhaustive) would you feel most com- 
fortable. as the designer of this machine? Why? 


12.3 To illustrate the usefulness of random normal numbers in simulation prob- 
lems, work out the following simulation exercise. A hand simulation will be 
satisfactory. 

Three light bulbs are connected in series, so that all must light if any 
are to light The average lifetime (in hours) for each bulb differs, and is 
given with the standard deviation of each below. Assume that the bulb 
lifetimes are normally distributed with the parameters given. 

Bulb ABC 

Average lifetime t, hours 100 90 80 

Standard deviation r, hours 30 20 10 

As >ou simulate the three bulb lifetimes, find the life of the simulated 

assembly (which will be the smallest lifetime of A, B, or C.) 

A- Simulate at least 100 assemblies, and tabulate the lifetime simulated 
for each assembly. Plot a histogram of the assembly lifetimes. Observe 
liow the distribution indicated by the histogram becomes more regular as 
you progressively add results to the histogram. 

B After completion of the experiment described above, compute the 
average and stand deviation of assembly lifetimes, using the histogram 
data or the tabular results. You should get an average of 70. Why is this 
Twuli smaller than the average lifetime of any of the three bulbs? Why is 
the standard deviation greater than vary of ’die individual standard 
deviations? 


1-1 rteudent lonra. nho heed, a man company, i, a lard taskmaster. He has 

„f P ,7'°' o! U ’ Jones find* that the probability 
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ing the pay of those who stay with him for one quarter, thereby changing 
the probability distribution of their survival to the following quarter: 

1 Quarter 0.5 

2 Quarters or more 0.5 ™ ,BWro » 

Assuming that both new and old employees who quit are replaced with 
employees "off the street” how many persons will Jones hire over a three- 
year period? (Note: Set this up as a Monte Carlo Problem, and simulate 
two years experience, by quarters. Assume the quarters are independent 
Compare the results of a complete no-bonus scheme with a bonus scheme.) 


PROJECT 8 

Simulate a three-level supply system. Suppose there are 125 stores, representing 
Level 1, each receiving and placing an order for a given spare part (each month) 
with probability p = 0.2, and none with a probability q = 1 — p = 0.8. (Thus, 
total system demand at Level 1 is 125p = 25, and the standard deviation of the 
total demand is Npq = \/125 x 0.2 X 0.8 = 4.5. (From the above definitions the 
distribution of this total demand follows the binomial distribution.) The stores 
always order when demand arises, and they supply to customers at once or they 
back order, if stock is not immediately available. 

In addition, assume that each group of five stores orders from a jobber, so at 
Level 2 there are 25 jobbers. At the jobber level, suppose the store orders are 
filled as they come in (if stock is available, or backordered and filled next month, 
if not). The 25 jobbers order from a single supplier, Level 3, monthly in batches 
of eleven (an eleven-month supply) when jobber stock falls to two units or less. 
Assume the supplier has a very large stock on hand, so that jobber orders can 
always be filled at once. 

A. Simulate this supply system by hand or on a computer, and find the distribu- 
tion of demand at the supplier, Level 3. Initialize the jobber stock levels to start 
at eleven pieces each and store stock levels at one unit each. Simulate at least 
150 months, assuming the demand at the stores is, ol course, independent 

B. Comment on the size of the supplier’s standard deviation of demand. Would 
you expect it to be more or less than the 4.5 given by theory for the Level I 
demand? 

C. How would a change in jobber order size change the distribution of supplier 
demand? Would a change in initial conditions change the final distribution of 
supplier demand? 

D. Can you distinguish a transient and a steady-state result in this simulation? 
Should the transient data, if any, be used in accumulating data to describe the 
supplier demand distribution? 

E. In setting up the simulation, make provision for computing the average store 
and average jobber inventory levels and for recording the number of back-orders 
at the store and jobber level. What changes in initial conditions and ordenng 
rules would change the average inventory and backorder statistics? How do the 
ordering rules link the supply levels? 
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PROJECT 9 

Using the method of system definition used in the SIMSCRIPT computer language, 
as outlined in the last section of Chapter 4, and the Job-shop blackboard exercise 
illustrated in this chapter, simulate the job-shop example cm a computer. (As an 
alternate, use GPSS 111, or any other language you prefer.) The data for a 
deterministic problem are provided in Table 12.1. 

A. Compare the results for a deterministic simulation with the results shown in 
Figure 12-4. 

B. Introduce variability into the simulation at one or more points, and compare 
the results of the probabilistic case with the deterministic case. (A number of 
replications of the experiment will be required to obtain a distribution of output 
results.) Why will the output results vary more than you might expect from a 
casual inspection of this problem? 
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ANALYSIS FOR 
IMPLEMENTATION 


Most systems analysts find that they introduce change in the systems they 
study. They correct the faults of malfunctioning systems, or seek to improve 
an existing system. Or, they design new systems and seek to implement them, 
displacing older systems by new ones. Indeed, the systems analyst, probably 
has "the future in his bones,” to use C. P. Snows phrase. Otherwise, he 
would not have taken up his occupation. 

Often such changes greatly broaden the scope of the systems analyst's 
problem and call upon his knowledge not only of technical possibilities but 
also of institution and culture. 

When we consider this broader subject, discussions of the nature of man, 
a subject central to most of the great systems of human thought, eventually 
confront us. Based on assumptions or ideas, usually simplified to suit the 
times, the concepts of the rational man, the sinful man, the man of will 
and power, the economic man, and the Freudian man have all become the 
basis for arguments and theories of human behavior that are still with us 
in many forms. 

The latest trend in building a picture of human nature is to consider 
"behavioral man," who is defined by the sum total of his observed actions, 
rather than by assumptions about his character. For our present discussion, 
this latter viewpoint provides a more varied picture of the human scene 
and a number of detailed observational results that can be used by the 
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systems analyst. Because all operating systems at one or more °' 
Jhelr definition, development, analysis, and use are affected by huinan 
individuals, knowledge of actual belravror patterns m ^'entcm®* 
stances becomes an essential ingredient in understanding how systems come 
about and how they w»H be accepted and w ork in practice. 

In what follows we present a sketch of a major cultural conflict ana a 
corresponding discussion of behavioral science man that has meaning for 
the systems analyst who hopes to Implement system change . 1 


SOCIAL SCIENCE AHO TECHNOLOGICAL CHANGE 

Social scientists have long been concerned with the problems of technologi- 
cal change and haw they influence the society and the culture in which we 
live. The reverse problem, of course, is also important because the historical 
setting, and the tools and ideas presently known, affect the selection of 
projects considered worthwhile. 

In viewing this grand process of technological advancement and social 
change wc generally find a conflict between those who propose change and 
those who prefer the present state of affairs. In analyzing a given society, 
for example, it is useful to make a distinction between what Wheelis has 
called "the instrumental process" and “the institutional process.” These are 
concepts representing two opposing clusters of activities, attitudes, and 
kinds of criteria for what “good" is. Both processes affect groups within 
society at a given time, and indeed to a greater or lesser extent they affect 
the Individual at a given time. Thus, our understanding of these processes 
sheds much light on the ways in which system change may be acceptably 
introduced, so we shall consider each of them in turn. 

The Instrumental Process 


The instrumental process, « hich is concerned with “the facts," stresses 
replication, verifiability, and usefulness in social life. “The authority of the 
instrumental process is rational, deriving from its demonstrable usefulness 
to the hie process. Tin: final appeal is to the evidence .** 1 

The scientific method approximates the heart of the matter, hut the 
instrumental process u a larger concept that concerns the development of 
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tools and techniques (both physical and mental) used to solve problems. 
Electric saws and the differential calculus are both instruments for this 
purpose. It includes art, both fine and applied, because materials and 
methods are required in their completion. The instrumental process is bound 
to reality, facts are facts, it seems to say. Ignoring them is of no avail. 
Reality can be altered, particularly if it is clearly observed. Indeed, the 
better one understands it and the more tools one has to deal with it, the 
more radically it can be changed. 3 

We may not like the facts, but there they are— for better or for worse. 
Our job is to proceed with the project at hand. As we proceed with the 
instrumental process, we learn more and develop more tools, which, in turn, 
increase the output of the instrumental process and the number of possible 
combinations of what is already available. 

The instrumental process is respected because it is useful, productive, 
growing, and bountiful in many, though not all, spheres of human life. Yet, 
from an individual or social viewpoint, for very personal reasons, the instru- 
mental process "is often disparaged as mere problem-solving; for the security 
it creates, though real, is limited." 4 It may be respected, but not loved. 


The Institutional Process 

The institutional process, on the other hand, builds certainty, not doubt, 
for the individual. It seeks stability, sure-footedness, a rock of ages. Change, 
particularly rapid change, is shunned. 

Thus, both the individual and society build strong barriers, both con- 
scious and unconscious, for protection against change. Most frequently, 
these barriers seek an authority, organization, or tradition-in short an 
institution— larger than self or even everyday reality. This institutional 
process is diametrically opposed to what we have previously called the 
instrumental process. 

Everything mundane is subject to change, and hence certainty is not 
to be found in the affairs of men. The searcher arrives at his goal [of 
certainty], therefore, in a realm of being superordinate to man. Solomon 
put it succinctly: “Trust in the Lord with all thine heart; and lean not 
unto thine own understanding. 5 

Although religion is one example of the institutional process, the concept 
is broader than that. The institutional process includes customs, taboos, 
rites, mores, ceremonial compulsions, magic, kinship, status, coercive power 
systems, and such modem institutions as private property and the sovereign 

' Wheelis, op. cit., p. 75. 

* Wheelis, op. cit., p, 75. 

' Wheelis, op. cit,, p. 74. 
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state. As such. The authority of the institutional process is arbitrary; the 
final appeal is to force.” 

In particular, the institutional process is bound to human desire and 
fear. Wishing will make it so, it seems to say. It is unbearable that no 
one should care; so there must exist a heavenly Father who loves us. 
Activities of the institutional process do not, objectively, gratify any 
need or guard against any danger; incantation does not cause rain to 
fall or game to be plentiful. But such activities may engender a sub- 
jective sense of security, and this has always been a factor to be reckoned 
with-and, indeed, to be exploited. Honor and prestige accrue to the 
institutional process; for the security it creates, though illusory, is 
unlimited.* 


Some Findings about Behavioral Science Man 

The matter is put in a slightly different way by Berelson and Steiner, who, 
after compiling an inventory of scientific findings in the social sciences, 
summarize that body of knowledge on "behavioral science man.” T 


Perhaps the character of behavioral science man can best be grasped 
through his orientation to reality. . . . First, he is extremely good at 
adaptive bchavior-at doing or learning to do things that increase his 
chances for survival or for satisfaction. . . . But there is another way 
nun comes to terms with reality when it is inconsistent with his needs 
or preferences ... In bis quest for satisfaction, man is not just a seeker 
of truth, but of deceptions, of himself as well as others. . . . When man 
can come to grips with his needs by actually changing the environment, 
he does so. But when he cannot achieve such “realistic” satisfactions, he 
tends to take the other path: to modify what he secs to be the case, what 
he thinks he wants, what he thinks others want 
fin the latter case] he adjusts his social perception to fit not only 
the objective reality but also what suits his wishes and his needs . . . ; 
he tends to remember what fits his needs and expectations .... or what 
he thinks others will want to hear . . . ; he not only works for what he 
wants but wants what he has to work for . . . ; his need for psychological 
protection is so great that he has become expert in “defense mechanisms" 

. . . he will misinterpret rather than face up to an opposing set of facts 
or pomt ol view . . . ; he avoids the conflicts af issues and ideals when- 
ever he can by changing the people around him rather than his mind 

and when he cannot, private fantasies can lighten the load and 

carry im through ■ - - ; he thinks that his own organization ranks higher 


’ WWli , cm rft.p 73 
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than it actually does . . . ; and that his own group agrees with him 
more Fully than it does . . , ; and if it does not, he finds a way to escape 
to a less uncongenial world . . . . 8 

The introduction of change, particularly change that seems to the in- 
dividual beyond his control and which therefore threatens, or reduces per- 
ceived security, elicits reactions that are not necessarily logical and that 
the systems analyst may expect in the course of his work. If the systems 
analyst proposes change, which is the essence of the instrumental process, 
he meets the proponents of status quo, for the essence of the institutional 
process is to stand pat. 

Indeed, some institutional processes are so rigid, that the innovator con- 
tests them at his peril. In his Letters from Earth, Samuel Clemens mote, 

We do not know how or when it became custom for women to wear 
long hair, we only know in this country it is the custom, and that settles 
it ... . Women may shave their heads elsewhere, but here they must 
refrain or take the uncomfortable consequences .... The penalty may 
be unfair, unrighteous, illogical, and a cruelty; no matter, it will be 
inflicted, just the same.® 

And, some habits are so strongly ingrained that they are impervious to 
change. Many individuals work at night to avoid change. As one elderly 
worker, who had worked the night shift for forty years expressed it to the 
author: 

When I went to work there were no cars on the street, only horses. 
Wagons and horses. Now there are cars, and too many people. Too 
many. Why do 1 work at night? Things never change much at night In 
the morning there are cars and people. But, things never change much 
at night. Who needs those cars and people? 

It is interesting that the instrumental and institutional processes de- 
scribed above have been compared to the term ego and superego used in 
individual psychoanalysis. 

In those terms, the ego represents the executive department of the 
human personality — the instrumental activities that recognize facts, marshall 
resources, devise plans of action, and get things done. 

The superego is the judicial department, which performs a screening 
function, directs awareness, vetoes unacceptable proposals, and sets values 
and effectiveness measures used in goal formation. The superego is institu- 
tional in character, and derives its “conscience" from the culture, its cus- 
toms, habits, and mores. 


* Berelson and Steiner, op. cit., pp- 663-664 

* CJefliens, S„ Letters from Earth as quoted in Berelson and Steiner op. cit , p. 642. 
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Most studies stew that the conscience so tonned is highly relative 
culture or society in which the individual lives . 10 . a 

For many important issues, constraints upon the superego are s y 
small group-those near home as it were. Thus, the constraints are greater 
v.h«e families live together for several generations than where they do r>, 
greater in small towns than in large, greater at home than abroad, greater 
when there are strong religious beliefs or formal institutional ties than when 
there are none, greater when only one set of values is perceived tban 
when there are many that seem relative to tune or place. We are all familiar 
with acts and common phrases that illustrate these facts. 


Things happen in motels that do not happen in homes, and towels are 
swiped in distant hotels by persons who would not steal a pin in their 
hometowns. Some persons, indeed, travel for just this purpose— to lose 
3n unwanted reinforcement of conscience. For them wanderlust is not 
a lust for wandering but a wandering for lust." 


Similarly, methods exist for strengthening the superego or judicial func- 
tion in the personality (and therefore the institutional function in the cul- 
ture), and also for reducing its effect. Street lights reduce burglaries, and 
doubic-cnfry bookkeeping reduces embezzlement Conversely, in surround- 
ings where multiple values are evident and where institutional restrictions 
are consciously relaxed, the range of acceptable value and goal choices 
increases, leading to a greater variety of possible actions or considerations 
and to a greater potential for change. Thus, as institutional restrictions 
become less (and instrumental efforts are, relatively spealdng, more re- 
spected), new proposals become more easily accepted by those who must 
approtc and use them and vice versa.'* 

Note also that the capacity of the investigator or the individual to effect 
system change, or to alter his perception of the facts to suit his needs, is 
due to his capacity for the creation and manipulation of symbols. Language 
and abstraction form the concepts, ideas, and instructions that permit learn- 
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ing and the transfer of experience (and the very act of systems definition 
and analysis). 

This capacity, which permits the analyst to generalize, also permits him 
to change the name of the game to suit his needs. 

Although it would appear to be a fact that a rose by any other name 
would smell as sweet, for individuals who view a scene this invariance of 
description may not hold. A plain ham-and-cheese sandwich may taste 
better if it is described as a “wedge of cheddar wedded to a generous 
portion of prime Virginia ham surrounded by California tomatoes and 
Florida lettuce and a discrete portion of pure egg mayonnaise.” Even though 
a simple yes or no might suffice, a problem solution may seem more impres- 
sive and convincing if couched in mathematical symbols and presented with 
a slight but correct accent by a man with a Ph.D from Cambridge. 

But, because our symbols, our abstractions, and our ability to conjure 
up favorable or unfavorable impressions with a word or gesture are products 
of our culture, and because the participant-listener's symbols and abstrac- 
tions are formed by his culture, a knowledge of how symbols are formed 
and held is the key to our understanding of culture . 13 

Finally, we should note that institutional problems are magnified 
as the instrumental process advances with time. Although institutions 
do change under the impact of technology and instrumental advances, they 
change slowly and reluctantly, . . and make peace, finally, with the 
conditions which altered them .” 14 

But institutional change is slower than instrumental change. By the 
time the institutional process, or the culture, has made peace with instru- 
mental change, . . technology has moved on, and the laggard is still trail- 
ing.” 15 Indeed, the discrepancy between instrumental growth and institu- 
tional change becomes worse with time. The combinatorial possibilities of 
instrumental growth are geometric; more tools, more techniques, more 
facts, and more concepts lead to a cornucopia of new possibilities. Tbe 
instrumental process is regenerative, but the institutional process does not 
partake of this bounty and holds steadfast, unless intimidated, coerced, and 
pummeled into movement. 

The resulting effect is a cultural Jag-an ever increasing gulf between 
what is possible and what is acceptable. 1 * 


" A culture is most frequently defined by its community of understanding, and may be 
measured by the frequency of compatible interactions that occur between its members, 
as indicated by K. W. DeirKch, Nationalism and Social Communication, Technology 
Press and Wiley, New York, 1953. C. P. Snow expressed the same idea in literary 
terms: "Without thinking about it, they respond alike. That is what culture means." 
To speak of a cultural iag is perhaps not so descriptive as to speak of a cultural conflict 
or gap between the instrumental and institutional community of understanding. 

" Wheclis, op. cit , p. 82. 

” Wheehs, op. ctt., p. 82. , „ 

■« The culture lag was first discussed in length by W. F. Ogbum, Social Change, Dell, 
New York, 1927. 
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Introducing Change 

Changes that alter no dearly held belief, custom, or mode of habitual opera- 
bon are often unreduced with relative ease in highly technical areas. Tools 
may be redesigned, new production methods may be introduced new 
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new drug— and their smaller number makes them easier to convince with 
limited resources— then their approval can have a strong institutional 
influence in maldng the technological advance acceptable to the total list. 
Example: The elimination of elevator operators (by the substitution 
of push buttons) to reduce operating costs may well be spoken of as 
an attempt to improve passenger service, to increase passenger comfort, 
and to reduce delays in the lobby. The acceptable symbols are stressed, 
and the less acceptable image of a mercenary landlord throwing old 
retainers out of work is played down. 

Another benefit derives from these seemingly devious devices. Change 
is easier to introduce in form than in substance: When imposed "from the 
outside” the forced change may result in overt compliance, but covert 
resistance. If the proposed change can be made to come "from the inside” 
(or seem to), the form and the substance of change are more likely to 
coincide. The change is also more likely to be permanent and not to be a 
mere verbal acquiescence until the analyst has gone. 

Anthropologists also tell us that in the evolution of a culture pattern 
action comes first and values second. We begin to like what we do. Thus, it 
is frequently easier to change the values of individuals concerned with a 
system by alteration of their activities rather than by a direct attack upon 
their beliefs. 

Example: When a system’s users are unfamiliar with it they may be 
afraid of it, and thus hold it in low esteem. However, after experiencing 
success with its use they become more optimistic about their ability to 
influence their own future, and the values employed by the users in 
assessment of the system change. Consequently, instead of describing 
the virtues of a new product or system, a demonstration model is put 
in the user’s hands so that he may convince himself of its value. 

Similarly, if behavior can be changed, a change in values usually 
follows more easily than if the reverse procedure is followed. 

Example: These alternate approaches to change are evident in the 
policies of the Eisenhower and Johnson administrations with regard to 
racial discrimination. The Eisenhower policy was that social values 
would change with time, leading to later changes in behavior. The 
argument for this approach is that when the behavioral change does 
occur, it will be permanent because it is reinforced by the individuals 
value structure. The Johnson administration, on the other hand, in urging 
stronger legislation against discrimination, sought to change behavior 
from which a change in values would emerge. It is argued that the latter 
course produces permanent social change more swiftly than the former, 
although covert resistance may be expected at the outset. 
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When introducing change in la, gcscale systems and 
policies are much easier to change than procedures. This i ob mat «"• "> 
by many professional administrators and systems analysts does not c n 
S * ha, just been said, hut blends instrumental considerations 


'"'l-tat the detail and variety of specific changes that must be made m 
procedures to obtain a given result frequently exceed the resources ot the 
controller. And, because procedures usually adapt themselves to the de- 
mands of policy, much mare can he accomplished in a given time with 
limited resources by the policy-change approach, leaving resources avail- 
able for the adjustment of critical procedures when necessary. 

But, perhaps more important, a change in policy rather than in de- 
tailed procedure, leaves some freedom, although it may he illusory, for the 
individual to adjust to the proposed change. His values and institutional 
roadblocks are not directly confronted, and he is more likely to accept "his 
own” adjustments as good ones. 

For similar reasons, making a few large-scale policy changes is often 
easier and more effective than instituting many small changes. The few 
major changes can be controlled, and the individual docs not develop the 
frustrations, confusions, and value disturbances that a series of harassing 
minor alterations will produce. 


PROBLEMS OF IMPLEMENTATION 


Without going further, it would appear from what has been said that many 
repetitive difficulties arise when certain kinds of change arc proposed and 
introduced, and that from this experience much has been learned about 
social and cultural changes. Moreover, numerous techniques and artifices 
may be used in easing the introduction of instrumental change and in miti- 
gating institutional obstacles. Yet the field is an open one, and much remains 
to be done. 

If this is so, it would appear reasonable to conduct analyses aimed at 
the implementation of specific systems, or at specific systems improvements 
-so-called action research-wbenever system innovation and improvement 
are proposed. 

For example, the eflorts of political scientists and public relations firms 
to elect a given political candidate, to ease passage ol a school bond issue, 
to reduce racial discrimination, or to promote the fluoridation of drinking 
water represent research of this type in the social sciences. Market and 
advertising research also has the same "action' flavor. The same may be 
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said for the use of psychological tests and research in personnel selection 
for given occupations. The research is not “disinterested” in its outcome; it 
is purposeful and goal directed. 

Such activities, which we prefer to call analysis for implementation, are 
often shunned by the scientist (for reasons we shall mention hereafter), 
although, for a particular purpose and stated objective, analysis for imple- 
mentation may greatly increase the success of a technical advance, dis- 
covery', or improvement. 

For example, even relatively simple system changes may raise questions 
for which the uninitiated analyst is unprepared. The automobile owner hesi- 
tates to spend more funds on his car because his children are sick, so he 
believes the car will run another year— even though the mechanic knows 
objectively that the vehicle is potentially dangerous. What is the mechanic 
to do? The patient hesitates to have the clearly indicated operation and 
believes he will get better. Should the physician retire after stating the 
bald facts? Persons who have not experienced the disasters of a tornado, 
flood, or large explosion tend to deny or to disbelieve warnings that danger 
is near. They search for more information, and often “. . . interpret signs 
of danger as signs of familiar normal events until it is too late to take effec- 
tive precautions.” 18 Should the weather man report his scientific conclusions 
and make no interpretation or exhortation to his listeners to take care? A 
system that takes care of todays conditions is installed, but it does not 
provide for future contingencies or for “updating” the system as conditions 
change with time. Later the system efficiently performs functions no longer 
relevant to prevailing conditions, with possibly disasterous consequences. 
Should the analyst not concern himself with these problems of implementa- 
tion and use? 

Should the Analyst Concern Himself with the Problems of Cultural Change? 

There are many scientists, and thus many systems analysts, who would, 
on serious grounds, take issue with the proposal that the investigator con- 
cern himself with the use made of his work. Rational investigation and 
logical decision cannot take place, they argue, if one has a vested interest 
in obtaining a given outcome. This position is strongly held by science as 
an institution because of the historical struggle to free science, and the 
instrumental process, from the constraints, the dogma, and the myth-making 
of institutions. 

There is yet another reason for the strength of this position. As older 
institutional constraints have been removed, others have come into play 
as the individual searches for the universal certainty which the rational 
process cannot supply. 

“C. E. Fritz, "Disaster," in R. K. Merton and R. A. Nisbit (Eds.), CcmUmporory 
Social Problems, 1st cd., Harcourt. Brace & World. 1961, p. 665. 
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For example, a vote of the majority or a mass movement may threaten 
the dispassionate scientific pursuit just as much as constramts unposed by 
TJZch or the clergy. The mathematician does not amve at a problem 
solution by conducting a public opinion poll on the street and averapng 
the answers obtained, even if those polled are other mathemahcmnsl The 
test is different. It depends upon the demands of a verifiable procedure, 
not the beliefs of any individuals, regardless of their reputation, their power, 
their eminence, or their number. 1 * If in his choice of variables or the develop- 
ment of alternatives (both of which are subtle selection processes) the 
scientist consciously begins to favor or exclude one group of possibilities for 
institutional reasons, or if he must shade his thinking, model building, anu 
verification to meet institutional constraints, he has abandoned the basic 
tenets of science. Surely, he says, many new discoveries that might other- 
wise be his will be lost, or worse he will be deluded and falsely evaluate 


what he observes. 

The scientist may also argue that his energies are limited. To wony 
about implementation will not only debase his pursuit of knowledge but 
will also embroil him in a conflict of personal values that will consume his 
limited resources in a wasteful fashion. How can he simultaneously be a 
myth-maker and an iconoclast? 

The dilemma presented by arguments for and against research for 
implementation is well known, and although exaggerated here for effect it 
is nevertheless real. The scientist-analyst may choose to avoid the problems 
of implementation altogether, but someone will implement the results of 
his w'Ork, or not, and the job may be done well, or poorly, depending upon 
how well the analyst and the implementer understand each other.* 0 


Leadership and Organization to Bridge the Cultural Gap 

One resolution of this difficulty— which almost always arises in applications 

of management science and operations research, to name one area of systems 


For example, one test for an Instrumental versus an institutional process is whether a 
conclusion Is validated by the success of a procedure or the agreement of an individual 
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analysis— is to conduct a project with mixed teams of investigators, some of 
whom have instrumental skills, and others who have institutional skills. 
These skills when shared in the work group bridge the interface between 
the two worlds and permit a smoother transition between problem defini- 
tion, analysis, and implementation. 

Many organizations use similar devices to bridge the gap between in- 
strumental and institutional requirements. For example, often an organiza- 
tion has two leaders instead of one, although one man may be apparently 
in charge. The role played by one leader is to institute instrumental change. 
The role played by the other leader is to represent the institutional require- 
ments of the organization and the component individuals. When the instru- 
mental leader causes friction, the institutional leader smooths it over or 
rephrases the requirements in more acceptable, wanner, more congenial, 
or more orthodox terms. It matters little who is the obvious leader, so long 
as tlie two cooperate (and can stand each other) l Together they will be 
able to produce more change in the organization than either could accom- 
plish individually. The family unit, to come closer to home, offers a similar 
example of dual leader effectiveness in creating change in the habits of 
the young. 

A final example from intelligence operations is interesting because it 
illustrates the power of the same approach, employed to different ends. 

Example: Many of the intelligence services of the world employ a 
method of interrogation in which two interrogators, each employing a 
distinctly different role characterization, alternately confront the sub- 
ject. The first man, for example, may affect a stem air, a military costume 
with riding boots and crop, a crew cut, an air of efficiency and dispatch, 
and a stem, cruel disposition. As the perfect martinet, he demands the 
subject talk at once or be shot at dawn, withdraws all physical comfort, 
inflicts various apparent physical and mental tortures— just as the subject 
might anticipate. Finally, in exasperation, ordering the firing squad to 
be assembled, the first interrogator leaves the room. Very likely he has 
learned nothing, and expected to learn nothing, from the subject. After 
some time, the second interrogator enters. He is an entirely different 
type. Dressed in baggy tweeds, probably smoking a pipe, he seems di- 
stressed with the subject’s plight, orders some food to be brought at 
once, produces cigarettes, offers apologies for the abruptness of his 
colleague, and settles down to commiserate with his unfortunate friend. 

He may offer some personal experiences of his own, some philosophies 
and observations on life and his friends present plight. But then, he 
is unavoidably called from the room, and the subject is again alone. He 
did not expect this. As the night wears on, the alternate presences con- 
tinue. First he is threatened, then he is consoled. In the end, although 
some time may be required, the subject begins to change: He sees the 
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he talks, as was Mended, most likely to his tweedy fnend who in fa<* 

was the instrument of the subjects change.” 

Little more can be said here on this topic, hut the atlminlstratorof 
systems analysis projects who seeks to Mend instrumental »"d 
skills far a given study can benefit from further study of the dual role 
approach to organization. 


ORGANIZATION OF DATA FOR IMPLEMENTATION 

To continue, let us suppose the investigator is concerned with the problems 
of implementation, and that his concern is to anticipate problems that may 
occur in a specific case, rather than to worry in general. 

Many of the examples and points cited in our next few pages follow 
from underestimates of institutional power to affect routine and emergency 
operations of instrumental processes. 


Why are Implementation Data Scarce? 


In most specific cases, the selection of treatments and the anticipation of 
specific problems and reactions requires high technical competence and ex- 
perience in a given field. Nevertheless, in a given field, the data of past 
experience can be organized in an orderly fashion and “what if” questions 
can be ashed for specific analyses of implementation. For example, the 
collection of Symptom-cause relationships described in Chapter 11 might be 
a typical example for tins type of display. (Problem characteristics would 
then replace symptoms, and probable difficulties would replace causes.) 

However, in collecting data for implementation, the purpose of the 
collection is expanded. The emphasis is not on the apparent difficulties and 
present symptoms, but on those that might appear in the future, or that 
have been Vnown to occur in the past, given specified side conditions. 

In medicine, for example, certain treatments or drugs are known to be 
cwmterindicated when the patient has a given past history, and these 
oounterindications are reported and publicized in the profession. 

In the same way, certain forms of system alteration or modification aTe 
known to raise given problems with great regularity. When information 
systems are installed, or when decision-making processes are analyzed and 
changed, w e encounter the same problems over and over again. 

Unfortunately, in the latter case, published warnings and admonitions 


"f"„*** <rxtraslve discussiM of interrogation and training methods in tins form of 
intelligence operation, see O Fmfo, Snatcher, Harper, New \SJS2. 
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are uncommon. The investigator who analyzes and installs business systems, 
for example, is seldom forewarned against probable pitfalls, and he seldom 
has organized, published data to aid his implementation work. We may 
have diagnostic aids for hardware maintenance, and for trouble-shooting in 
many fields. However, there has been little emphasis to date on the diagnosis 
of conceptual or decision-making failures on the part of the analyst when 
he implements systems change. Where such data are available, the files are 
kept secret or transmitted from one worker to another as an art— for institu- 
tional reasons which should be obvious from our previous discussion in this 
chapter. 

For example, from the 35,000 or so electronic computer installations 
made in the United States in the past ten years much has been learned and 
published about electronic difficulties, hardware reliability, component 
failures, and design defects. Yet little, if anything, has been reported in an 
organized form about the many specific failures of the systems that have 
been installed— if those failures were due to faulty decision-making, inade- 
quate system conception, or lack of individual experience and foresight. 
Little, if anything, has been published about the specific difficulties those 
systems have encountered as times have changed. Not only is the nature of 
such data collection and organization difficult, but institutional barriers also 
prevent publicity. The physician does not testify against his colleagues 
except within the instrumental framework and the users and manufacturers 
of computing equipment are not likely to publicize their failures except 
when the onus can fall upon the hardware details, and frequently not then. 
The available data are locked in consultants' files, or in the notebooks of 
internal investigators. The auditor has his check list of devious practices to 
look for, but he is not likely to publish a statistical account of his, or his 
clients', difficulties and shortcomings. 

The point of these illustrations is that such compilation of problems, 
even if maintained privately by the investigator, can be a powerful tool in 
anticipating and preventing a repetition of implementation difficulties. The 
construction of such a listing is thus one of the major steps in analysis for 
implementation. It is a valuable step if the result is only a list. It is an even 
more valuable tool for the analyst, in a given case, if the broad listing can 
be classified and cross referenced in a hierarchy of problem types, with 
specific classifications for the anticipated problems of specific system types. 

In such a tabulation it would be seen that many of the specified problem 
areas arise when instrumental and institutional processes meet at an inter- 
face, when man-and-machine or man-and-man meet. 

A Problem Anticipation File 

To illustrate one such listing, we now present a selected group of imple- 
mentation problems that are general enough to provide both a manageable 



j 2 s Ma/y$/s for /mp/omenfat/on 


tet and a set ot categories for further development. For each of these 
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ment to the reader. The categories chosen are tahen from a composite ot 
systems analysis problems in several fields, and are presented in the orm 
0 { questions to stimulate further discussion and thought. 


1. Are the objectives and constraints perceived by the JnvwlJgalor the tame « those 
perceived by the organization? 


Frequently, differences in cultural background and the conflict between 
professional and administrative interests cause the systems analyst to solve 
the wrong problem. This most subtle of systems mistakes happens so fre- 
quently— even when the problem of analysis is stated in writing and agreed 
to by the analyst and the uscr-that great care must be taken to reach a 
common understanding of what is needed. 

The trouble is usually not so obvious as maximizing profit instead of 
minimizing cost, although that type of error is frequent enough. Usually* an 
objective is stated formally by the system user, but perhaps incorrectly or 
incompletely. So, the analyst may not at first perceive constraints that the 
user imposes upon acceptable solutions, or, conversely, he may anticipate 
constraints that are not in fact considered important. He may often be 
purposefully deluded. 


Example: A major oil company sought a scientific procedure for locat- 
ing filling stations as new stations were added to the distribution system. 
Initially, the objective was to locate stations that would return at least 
a minimum return on investment, as specified by the firm's management, 

A procedure that included many factors, among them estimated sales 
o\ petroleum products at the proposed stations, was developed. Checks 
of the new procedure indicated that it would locate stations more con- 
sistently and reliably than less organized methods. However, the new 
procedure was rejected by the executives charged with station location. 
Alter some time these executives revealed the method was not suitable 
to them because it included forecasts of station sales, and this limited 
executive flexibility in acquiring new locations. As it turned out the 
true objective was to add 200-300 new stations to the system each year. 
Many proposed station locations could not male the required return 
on investment, and competition for good locations was severe. Knowing 
tiiat the computed return on investment was based on internal transfer 
aU ° ind " dw! contributions lt > profit, the executives often 
inflated the estimate of station sales to get around what they felt was 
an unrealistic investment requirement and come up with the required 
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number of new additions (which otherwise would not have been possi- 
ble), Had the proposed method taken these facts into account, the 
systems analysis project might have been implemented, rather than 
rejected. It is not surprising in this instance that proposals for follow-up 
studies to compare predicted sales and investment return against actual 
values were also rejected. 

The analyst may avoid this pitfall or sidestep it in many cases by 
investigating in advance alterations in constraints and objectives that may 
be of possible interest, and by determining how the problem solution is 
affected by alternate problem statements. He may find that the problem 
statement contains many noncritical factors, learn which factors are critical, 
and be prepared for the presentation of alternate proposals should they 
be required. 

Because of his training and knowledge of methodology, the analyst may 
also tend to frame problem objectives and constraints to make his job easier, 
unwittingly leading also to the solution of the wrong problem. The work 
may be professionally competent and workmanlike in every respect, but 
it may also be irrelevant to the present need. The cliche that the right 
problem solved approximately is a better result than the wrong problem 
solved precisely is a fair warning. Most maj'or problems of systems imple- 
mentation are introduced at the beginning, when the wrong objectives and 
constraints are assumed in a system definition. 22 

There is another reason for looking carefully into the goals, objectives, 
and constraints to be used in a systems analysis project or design. The 
implications of the chosen goals, objectives, and constraints may not be 
clearly understood at the outset and a solution which seems desirable at 
first may in fact be fraught with difficulties that once commenced are 
difficult to overcome. 

For example, when automatic or automated systems are given a goal, 
it is, as the late Norbert Wiener observed, like invoking a form of magic. 
The goal is interpreted literally, and the full implications of a given instruc- 
tion or objective must be understood by the analyst if he is to stave off 
embarrassment and woe. 

Folklore, for example, contains stories-ranging from the Sorcerers Ap- 
prentice to the Monkey’s ?ato-in which magic, once begun, was difficult 
Or impossible to turn off. The Apprentice learned the words to bring magic 
to the alleviation of his workday tasks, but forgot how to stop the magic 
broom and the magic pail of water from overdoing their appointed tasks. 

•“‘At first it is impossible for the novice to cast aside the minor symptoms, which the 
patient emphasizes as his major ones, and to perceive clearly that one or two facts 
that have been belittled in the narration of the story of the illness are in reality the 
stalk about which everything else In the case must be made to cluster Herbert Amory 
Hare, Practical Diagnosis, Lea Brothers, Philadelphia, 1899. 
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(Fortunately, be was saved by the Masters return.) In the .Wontn/* Pa® 
and many similar tales, three wishes were panted. In each ease, the first 
wish J Med. but with unsuspected side effects And usually “ ' ' h “ 
stories the last two wishes were used to undo the honors created b> th 
fulfillment ol the first. 

Thus, at the outset, the analyst should beware lest lie have King Midas 
touch> 


2. Are the etlectWene** measures used In the analyiw appropriate? 

Since goals are formulated from sets of values or effectiveness measures, 
the measures used may be incorrectly cbosen and throw off the analysis. 

Example: A classic example, reported by Morse and Kimball-’, from 
their World War II experience, concerns the installation of antiaircraft 
guns on merchant vessels. On the one hand, guns installed on these ships 
were so “ineffective* as to be useless; on the other band, they made the 
cress's feel safer. Because the guns svere expensive and \s ere needed else- 
where, their removal was proposed. Indeed, data on equipped and 
nonequipped ships showed that only 42 of attacking planes svere shot 
down, a dismal figure that served to indicate the guns were not worth 
installing or keeping aboard. On second thought, boss ever, it was ap- 
parent that the percentage o! planes shot down Was not the correct 
effectiveness measure for the guns. Guns ssere installed on the ship to 
protect the ship, and the proper measure svas \sbcther the ship W3S 
damaged less with or without a gun. Analyses of the observed data in 
this light showed the guns definitely increased the ship’s chance of 
survival. Even though the antiaircraft guns did not often shoot down 
the attacking planes, a guns use lowered attack accuracy, reduced 
damage, and often saved the ship. The change of measure changed the 
derision, and the ships were equipped with guns. 

A typical error in selecting effectiveness measures is to scale alternatives 
by their ability to reach a given objective without regard to the resources 
consumed, or to seek a resource measure only without regard to effective- 
ness. The property of “cost-effectiveness" which gives the contribution to 
effectiveness per unit resource, is often used to remove this problem. Even 
then, it is essential that the “effectiveness" measure be the one desired.** 


’L s °* 0pt,d,im ' T « 5 “» v w *«< 

** cu «f d « Chapter 8. See also C. J. Hitch and ft. N. McKean, 
mo t H , aTv3,d tinivmny Tress, CambriS 

UfiWmity of California 
C Operations Research Society d America has a “Cost Effec- 
tiveness Set**, which «®ca*rat« on problems of measure selection and 
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3. Are Ihe attention and awareness functions of the system correctly oriented? 

Newly designed systems or organizations and those that have been in 
operation for some time can suffer from problems of incorrect awareness or 
goal rigidity, generally described as “fixation” problems. Goal-directed 
systems have some form of goal setting and holding function. And, because 
a system’s awareness of alternate possible courses of action, forms of 
organization, and other goal and value possibilities limits the types of goals 
and values that will be formulated, goal setting and holding functions of 
a system can be affected by these factors. 

Example: If the reader will tightly hold the thumb of his left hand 
in his closed left fist, and close his eyes, he will after a time “feel" that 
his thumb is larger than the other four fingers combined. This “percep- 
tion” of the size or importance of the thumb, were it constant, could 
easily alter the individual’s work habits, selection of desirable jobs, and 
even his sensory view of the world about him. The pictures drawn by 
children, which seem distorted to adults but realistic to children, are 
another example of how awareness and perception affect the organiza- 
tion of behavior. The senses most acutely tuned to a given scene, and 
the “mix’” of a system’s sensory input types can greatly affect behavior 
and goal-setting functions. An analysis of the nerve structure of the pig 
would cause an analyst to believe that the pig considers his snout a very 
important information source. The pig thus "views the world through 
his nose.” Several authors have discussed this problem of perception and 
awareness in different contexts . 25 

Difficulties arise if the data required to solve a given problem are not 
available, are not sought, or are not perceived “realistically,” i.e., with 
respect to the system's survival and growth. 

Similarly, the methods used for combining data, generating new goals 
and values, and updating presently used objectives may cause trouble. On 
the one hand, the goal setting process may be too rigid. Like the driver 
who locks his steering wheel and drives over the cliff, many systems can 
reach disaster if erroneous goal locking is present. On the other hand, lack 
of goal stability is also a problem. Then, the system acts like a small child 
who first picks up one toy, then another, in a succession of unrelated 
activities. 

“See E D. Adrian, The Basis of Sensation • The Action of the Sense Organs, Norton, 
New York 1928, and the same author’s The Physical Background of Perception, Claren- 
don Press’ Oxford, 1947. Also, D. O. llebb. The Organization of Behavior, Wiley, 
New York, 1949, and W. Penfield and T. Rasmussen, The Cerebral Cortex of Man, 
Macmillan’ New’ York, 1950. In a completely different vein, the works of Marshall 
McLuhan are based upon changes in the ratio of sensory inputs from different media, 
such as the printed page versus television, and their effects upon the individual and 
script v See The Gutenberg Galaxy, University of Toronto Press, Toronto, 1962 and 
Un&anding Media, McGraw-H ill. New York. 1964. 



434 Analysis for Implementation 


Example: The November 9, 1965 electrical blackout of the eastern 
seaboard v.as traced to a relay located at the Ontario Hydroelectric 
Commission's distribution plant at Queenston. Ontario, near Niagara 
Falls. According to the Federal Power Commission's report, the Ontario 
relay was set to operate a circuit brraler if the poner load exceed 373 
rndhon watts. It was set at this point in 1963 and was not reviewed. 

fntelT,' T* 8 * lMd on the lme co "' rolM b F telay 
n er ,' Va,tS ' ? nd « b " s »" -rftoty upward fluctuation 

p °' er tapped the relay and started the whole blachout” 

deteriorated from d«<; 1 , t * ata> component operation has 

time but without the analyst's'klim M SyS ’ Cm * lnlcl "' e bas changed with 
the System are not hep, current with pSem “ d Va, “ B ° f 
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These signals can often be captured and decoded by an industrial spy, 
usually with little chance of detection. For example, a teletype machine 
generates sparks as it prints. Even when located in a shielded room, 
its signals can be detected from several miles away, and reproduced with 
suitable equipment if security precautions are not taken. The problem 
of information protection is increased when information processing 
facilities are shared, as is the case in real-time computing centers. Plan- 
ning for the World Trade Center in New York, where computer systems 
on every ten floors will be shared, includes protection against monitoring 
systems. Every computer system for defense operations must be pro- 
tected against monitoring, according to the specification of FED-STD- 
222 and DCAENS 422-5s, the contents of which are known only to 
those with clearance. Less exotic forms of data falsification and theft (as 
well as material spoilage and theft) often present difficulties that are 
overlooked. For example, material theft, damage, hoarding, and similar 
activities can cause major data errors in inventory control systems. 

8. Are operators actually performing according to the system definition and plan? 

Many systems have not performed as predicted because an essential 
operation was not performed by an operator or group of operators accord- 
ing to the plan assumed by the systems analyst. 

Example: Electronic computers are often used to calculate minimum- 
flight-time and minimum-fuel flight plans for commercial aircraft. These 
plans take into account weather, wind, load, and similar factors, and 
are prepared for the pilot just before takeoff. When first introduced, the 
time and fuel reductions predicted by these schemes failed to material- 
ize. The question arose as to whether the optimizing computations were 
in error or whether pilot adherence to the computer plan was unsatis- 
factory. An investigation of the latter problem led to increased pilot 
cooperation, and the predicted savings were at last obtained. 

9. Wilt the system accept and act upon signs of impending disaster? 

Although the desire for survival and perpetuation is often strong in 
organizations and individuals, and is often designed into procedural and 
hardware systems, this survival or continuity of operation is often threatened 
when the system fails to act or delays action in response to clear signs 
of danger. 

There are cases in which the system is not “aware" of impending 
danger because such input data are not received or sought. But even when 
such data are in hand, many technical and institutional blocks can prevent 
correct evaluation and acceptance in time for adequate action. For example, 
the required pieces of intelligence information may be on the “desks” of 
several intelligence analysts who do not individually see the emerging 
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pattern of danger and dismiss the isolated pieces oi intelligence osttnta- 
portant The organizational leader may have surrounded h.mseli '«* weak 
yes-men vvho fail to tell him the truth. There may be techracal delays tn 
transmission, reception, and decoding and possible mistaVes along the y. 

But even when these faults are overcome, the executive, the oymlor, «t 
the control system often cannot, or will not, accept and act quicVly «P° 
the {act that disaster is at hand. . . 

The following is reported in some detail as a modem and classic ex- 
ample ol this often neglected implementation problem. 

Example: In the major eastern power failure of November 9, 1963. 
previously mentioned, the first signs of failure were at 5:16:11 P.M. when 
service was interrupted to only portions of upstate New "YotV and 
Ontario. 

There was a period of some 7 minutes to 12 minutes between the initial 
disturbance at 5: 16 P.M- when the service to the various portions of southern 
New York and New England finally collapsed. The operators at the various 
dispatching centers all knew after 5:16 on November 9 that the frequency on 
their system was going down and that the load had reversed and was placing 
a large drain on their generating reserves. . , , 

The night of the blackout a 62-year-old Con Edison Engineer, Edwin J. 
Nellis, who has been with the {New York City] utility for 41 years, was on 
duty at Con Edison’s automated Energy Control Center on Manhattan’s 
West Side. At 5:16 p-sr., the lights dimmed, and Mr, Nellis called for an 
instrument check, which showed nothing wrong with Con Edison’s generating 
equipment. 

Checking his instruments again, Mr. Nellis noticed a surge of power in 
from the north, then a surge outward. Con Edison officials report that he im- 
mediately called the Niagara Mohawk Company in Syracuse to determine 
what the trouble was. At the same time he was ordering all 12 Con Edison 
generating plants to peak power output— to handle any extra demand should 
the trouble be serious. 

Meanwhile, three other stations in the power network, or grid— a station 
in Flushing, Queens, one in Rockland County and one in Orange County- 
were reporting troubles of their own to Mr, Nellis. 

Vt was then that the lights dimmed a second time, and instruments showed 
a tremendous surge of power into New York, then out again. Mr. Nellis was 
now on the phone to Syracuse, which told him ’of trouble to the north.’ 

On beanng this, he told Syracuse, “I’m going to cut clear of you,” and 
began pusJnng the first of eight buttons to cut Con Edison away from the rest 
of the gnd. I e also began ordering Con Edison's network stations-^ in all- 
100 u "- “ d fa ** “ *•* «** — 

,, J" II* [federal Povier Commission! report said, "it warms 

£* <W4», ol the ted in !0n « secBtL^NC y„,k S 
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as well as facilitating restoration of service. But whether because of lack of 
clarity in the control room instrumentation or for other reasons, the system 
operator did not make an immediate clear-cut decision in this emergency .” 

In reply, Charles E. Eble, president of Con Edison, insisted that the com- 
pany S operators "followed established procedures and in our opinion made 
proper and timely decisions in the short space of time available to them.” 

However, the automatic controls on Con Edison’s generators, as 
T. H. White aptly observed for Life Magazine, were quicker “to protect 
their own,” They cut out by themselves when the survival of their gener- 
ators was threatened by the massive heat of extreme overload . 30 

10. Have potential difficulties at the boundaries of functions, departments, componenls 
or modules have adequately ‘‘bridged”? 

The analyst may in general expect difficulty in system implementation 
when his system crosses functions or boundaries. Although this statement 
is true of hardware systems— more interfaces or intercommunications be- 
tween components generally mean more potential trouble— it is particularly 
true for procedural and information processing systems that cut across 
organizational boundaries. Additional care in implementation is always 
needed to bridge these sources of potential system disruption. Severe 
measures or implementation decisions may be required if the “boundary 
problem” is aggravated by a man-machine interface. 

Example; To avoid such boundary problems at fbe outset, many 
systems analysts take the position that a new procedural or information 
system should be designed either to be completely manual and so 
simple that any operator can understand ft or, at the other extreme, so 
automated that no human intervention is required in the intermediate 
steps from input to output. Many failures with combinations of manual 
and automated steps are the basis for this conclusion. 

11, Has the analyst correctly evaluated his own resources In relation to the task of 
analysis and Implementation? 

Gross underevaluation of the time, cost, and personnel requirements 
for system development and installation is unfortunately the rule for projects 
specified by performance standards. Similarly, overoptimism is often the 
rule when a systems project is proposed with fixed resources. Usually too 

“From the Federal Power Commission Report, reprinted in the New York Times, 
December 7, 1965, p. 40, and the report by Times reporter Thomas O'Toole de- 
scribing the sequence of events in New York, p. 41. Copyright 1965 by The New 
York Times Company. Reprinted by permission- One may well ask what good is an 
automated information or control system if the operators who have the final word on 
system action fail to take action or veto the action clearly Indicated by the system 
ItseH? 
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much is attempted with too Me, with the frequent result that the project 
flounders, and this leads to general disappointment and losses. 

Enmtple: The installation oS even a modest data processing fact tty 
svJl tike about two years, on the average, although many firms, o them 
regret frequently make estimates of a year or less. The many steps re- 
quired in preparing for a changeover to the computer system invariably 
bring snags that prolong the eSort and raise its cost beyond what was 


expected. 

Similar problems occur when the analysts’ range of actions is smaller 
than is required to handle the system he confronts, and an assessment of 
this relationship— as discussed in Chapter 6— is in order, particularly when 
implementation, with its greater variety of difficulties, is confronted. 

Finally, systems implementation is subject to many threshold effects that 
raise questions of resource availability. Often a little implementation does no 
good, and a basic minimum of resources must be committed to assure any 
degree of success. If this minimum is not available, attempts at implementa- 
tion are usually wasted, an economic fact of life of which the analyst should 
be aware. 


12 . t* the chovm tot and Implementation big enough? 

The scope of a system considered may not be large enough to result 
in any major improvement, or worse, it may lead to unwanted suboptimiza- 
tion. In addition, the costs involved in system analyses and implementation 
are often relatively fixed after a certain threshold of resources is reached, 
and may not vary thereafter directly with the size of the system considered. 
Thus, major improvements or large-scale improvements may be no more 
costly than lesser efforts, although the payoff can be much greater. 

Example: The design of an inventory control system for a small com- 
pany with 1000 items requires almost as much effort as one for a 
larger company with 10,000 or 100,000 items, although the absolute dollar 
improvements in the latter case will usually be much greater. The 
hardware and paper conversion will be somewhat more costly in the 
latter case, but development problems, personnel problems, and the 
political problems of installation are essentially the same. This fact 
usually places the larger firm in a better position to use systems analysis 
than the smaller firm. 


In the same way, the analyst within an organization should look for 
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zation”) Although this may appear at first to make good political sense, 
the result is often not as expected. Both the analysts and the organization 
become discouraged with the lack of demonstrable and dramatic return 
from the costs incurred after a time. The bold choice carries the risk of 
dramatic failure as well as dramatic gain; but the piecemeal approach 
carries the risk that the results of the smaller efforts are not measurable 
above the noise inherent in a system’s reporting and evaluation procedures. 

Thus, there is a minimum size for systems analysis projects set, on the 
one hand, by the threshold cost of analysis and implementation and, on the 
other hand, by the need for measurable results, which often must be 
dramatic to be noticed. 


PROBLEMS 

13.1 Consider the conversion from manual elevators, operated by a staff of 
faithful and pleasant workers, to an automatic bank of elevators, which 
would save money in the long run. What are some of the institutional 
factors that must be faced in a luxury apartment house? In an office build- 
ing? In a small town? In a large city? By an old owner? By a new owner? 

13.2 According to Womens Wear Daily , and columnist Samuel Feinberg (De- 
cember 3, 1965, p. 6), there is a consulting firm in New York City by the 
name of Einstein Associates specializing in executive search and “counter- 
feit detection." 

In conducting its search for executives, the firm proceeds through two 
interviews. The first interview, which may take from half an hour to one 
and one-half hours, is conducted in a relaxed atmosphere at Einsteins 
headquarters in New York. 

"Interviewer and interviewee sit side by side, perhaps smoking, and 
drinking coffee. This part of the discussion touches on peripheral issues— 
earliest recollections of home life, schooling, athletic interests, dating, 
and so on, leading through business experience. 

'The stress interview takes place on another day, consumes from two to 
three hours, and really fives up to its name. The analyst and applicant 
are seated across a desk from each other. This phase of the interview is 
designed to keep the man off balance by peppering him with a rapid 
succession of questions that render him unable to program answers or 
adhere to fradulent information he may have given during the previous 
meeting. Bather being job-oriented and dealing with hard-core informa- 
tion, the stress interview draws out comments on both sides of a contro- 
versial subject. 

“For example, the man might be asked: 'What would you respond if 
your boss accused you of an act of dishonesty?' The man might be 
expected to react in one of two way's in his reply: He might act calmly 
and ask for proof of the charge, thereby indicating he is a secure indi- 
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vidual able to rationalize under pressure Or he might get. 
start a counterattack showing msecurrty. If he emerges in the Erst class, 
ficatta, he might then be asked to play the role of the insecure mma and 
attack the employer. If he stumbles in the changed role, l,e is prohMy 
o K. If he plays the both parts convincingly, he is suspect. He might 
be asked to play out five or 10 hypothetical cases. In addition, other 
questions are raised. Typical: What does he think of his wife? How does 
he get along with his children? If he fumbles in answers that should 
come effortlessly, he's practicing delaying maneuvers to give him tune 
to make up palatable, probably fanciful, replies. , 

“After several hours of stress interviewing, Mr. Einstein contends, its 
not possible to continue tbe fraud; ‘the counterfeit executive inevitably 
unveils himself.' 

"Finally, the applicant takes a test battery given by an independent 
psychological testing laboratory.' 4 

As an example of the results of such a procedure, the author of the 
referenced article cites a case from the consultants' files. 


“Three men were being considered by a department store for promo- 
tion from buyer to merchandise manager. One man was thought by man- 
agement to be particularly qualified because of his profit record. During 
the stress interview with this candidate— he handled contract orders for 
lamps for institutional use-it was discovered he had been faking back 
orders for two months so as to paint a false high-profit picture and be 
in line for promotion." 


A. Comment on this form of executive search in view of the stress methods 
of troubleshooting discussed in Chapter 10 and the conflict between the 
instrumental and institutional processes discussed in this chapter. 

B From a systems viewpoint, why is the problem of stress and the resolu- 
tion of institutional and instrumental process important as individuals move 
up the organizational ladder? 

C To illustrate the effect of institutional background upon instrumental 
proposals— what do you think of the procedures used by the cited con- 
sulting firm? Would you like to be subjected to the stress procedure 
described? If not, why? 


13 3 


A review of the work done in the Amalgamated Dating Service, Inc., 
(Project 6 Chapter 10) indicates that we have concentrated on the techni- 
cal task of matching codes obtained from a question schedule filled out by 
male, and emales. Quest, ons may „„w be Iaised about th> ^ , , he 

ononal data that went into (Ids system o[ computations. 

LTh'rt"* TLTf”* I ’ r ° ccd "" : w W devised change ii the ques- 
,™LeuLtme" ”' ,e pr ° Vi ' W tte code Wat 


B How could the computing method devised aid 
improvement of the question schedule? 


in 


an experimental 



Problems 441 


C. What type of skills and knoweldge would be required to design and 
improve the question schedule? 

D. Would a team effort be useful in this task? 

E. What are the instrumental and institutional factors that affect this 
application? 



THREE WAVS 

14 TO MPROVE SYSTEMS 


In this chapter we discuss three general routes to system improvement: 
Modular construction, improvements in information handling, and clarifica- 
tion of the goals and the constraint structure of the system. These methods 
of system improvement are typical of the design, use, and planning strategies 
currently used to improve system operation. The reader may wish to add his 
own strategies to this sample. 


MODULAR CONSTRUCTION 

Systems design and construction using a number of standardized and well- 
defined modules provides one way for the analyst to create and maintain 
systems of great variety with a limited number of distinct component types. 

By creating variety by combination, rather than by a proliferation of 
nonstandard components and subsystems for each new system, the analyst 
can limit the variety he must control, and at the same time increase the 
variety of his systems and their scope of control. In terms of Chapter 6, 
uhich discussed the Law of Requisite Variety, such a ratio of “analyst” to 
"system" variety is highly favorable, and results naturally in a host of 
economies and benefits in the production, test, operation, and maintenance 
of modular systems. 
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Benefits of Modular Design and Construction 

Here are ten reasons why modular design and construction should be con- 
sidered a route to systems improvement. 

1. The reliability of a standardized module produced in volume is 
greater than that of a custom-built unit. The volume justifies research 
costs, generates statistics on modes of module failure under different 
uses, and leads to continuing module refinement and improvement. 

2. The module can be produced at a smaller cost than a custom unit. 
It can be assembled and tested at a smaller cost because the required 
variety of equipment is smaller, because automated assembly and 
test equipment can be designed and used, and because assembly 
and test procedures can be standardized. 

3. The module package, particularly its interconnecting links, can be 
perfected for reliability and ease of attachment to other module 
packages. The physical connecting devices for components, or the 
documentation and ease of interconnecting information blocks can 
be improved. 

4. The designer working with proved components can generate and 
evaluate alternate designs quickly and less expensively than other- 
wise. Often blueprints and similar custom design details can be 
eliminated by references to modules by code name or number in 
highly simplified specifications or worksheets. 

5. Stocks of modules can be simplified by reducing the number of 
special modules held. Because the design and production modules 
can be made the same, system prototypes can be made from stock 
at small expense. 

6. Management functions, such as purchasing, production planning, 
price estimation, cost accounting, and most industrial engineering 
activities, are greatly simplified by the use of modules. Module costs 
and module production and operation times are known and may 
be assembled with ease to get total cost and time estimates of small 
variance for the design, construction, and use of proposed systems. 

7. Knowledge of module speculations and methods of operation may 
be transmitted with ease to operators and users of systems in many 
different locations. This means that many people who are familiar 
with the given modules can be made familiar with a variety of sys* 
terns with little difficulty. Training, documentation, and similar 
systems implementation tasks are thus eased. 

8. Standard variations in basic modules may also be made in a stan- 
dardized way, providing an additional source of variety. For 
example, standard clothing designs can be "sized graded” automa (i- 
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cally from the basic pattern, providing a variety of garments from 
the single original design. 1 

9, Modular construction eases maintenance problems by allowing 
easy probe insertion between standard modules, and by allowing easy 
module replacement, fault diagnosis, and treatment. Failure epi- 
demics can be prevented by compartmentalizing failure sources in 
their modules. Routine test procedures, simplified test equipment, 
and interchange of operators and repairmen become possible with 
modular construction * 


10. The Fact flat information or material modules can be handled as 
Hooks rather than as a multitude ol pieces can greatly speed 
natenal and information handling procedures, and thus cut their 
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The greater the variety of modules, the greater the variety of combina- 
tions that can be made with a given number selected from the list of avail- 
able modules. The reader will recall that the number of distinct messages 
than can be made with exactly r letters taken from an alphabet of N charac- 
ters is N r . The situation is often similar with physical modules. Additional 
variety in the character set, or list of module types, greatly increases the va- 
riety of available combinations. But this increase in system variety comes 
at the price of module proliferation. Thus, a balance must be struck between 
the amount of system variety required and the number of module distinc- 
tions necessary. The trick, of course, is to find the minimum number of 
module distinctions that will provide the maximum number of useful system 
distinctions, because many system combinations, which theoretically could 
be produced from a given set of module types, will not be feasible or de- 
sirable. The techniques of Chapters 11 and 12 are frequently useful in this 
evaluation. 

If, in addition to distinct module types, the modules can be intercon- 
nected in various ways, the system variety that can result from a selection of 
r modules from a list of N distinct module types is much greater than N r . 
The bricklayer can construct a variety of structures with his single module, 
the brick. Clearly, different module types that can be connected in various 
ways-like a tinkertoy-can generate many more. The possibility of multiple 
interconnection greatly complicates the job of estimating the variety of sys- 
tems that could be constructed from a given assortment of types, but it 
greatly extends the modules potential for generating variety.® Thus, such 
forms of module design are widely used, and alternate interconnection 
modes in effect increase the number of module types available to the user. 

The "size” of a module presents another possible trade-off. The larger 
the module, the fewer the modules in the system, so that large modules mean 
less potential system variety. Conversely, the larger module is often easier 
to handle, faster to combine, easier to find if faulty, and reduces the number 
of troublesome interconnection points. The advantage of increasing the 
relative size of system modules must therefore be weighed against the 
potential system variety which is forgone when using the big block. 

Modules may also be sized for physical convenience, to balance the cost 
of replacement and repair against the alternative of replacement and 
discard, to optimize diagnostic sequences, or on other grounds. The choice, 
in such cases, is dictated by the application. 

Our main point in this discussion, however, is that the resolution of 
module size, variety, and interconnection method is a basic system decision 

* Many firms disclaim responsibility for use of their modules when the variety of possible 
combinations precludes detailed evaluation of all possibilities. For example, the Digital 
Equipment Corporation of Maynard, Massachusetts, manufacturers of “Digital Flip- 
Chip Modules” used in computer construction states clearly in its catalogue that the 
company . . makes no representation that the Interconnection of its modular circuits 
in the manner described . . . will not infringe on existing or future patent rights.” 
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•when modular systems are planned. The initial selection commits the dc- 
signer s scope and method of operation by setting the specifications for his 
resources. 

For example, the standard module for exterior house construction is 48 
inches. The 4' X 8' plywood panel is a familiar result. To save material, con- 
tractors usually build houses to that module. On the other hand, interior 
finish work, fixtures, and furniture, such as cabinets, rugs, counters, and 
couches, are often constructed on a 36 inch module. The & X 12? rug is an 
example. Unfortunately, these different module sizes often conflict. The 
kitchen is a foot or so too big, or too small, for its cabinets. Rugs and furni- 
ture often fit awkwardly in rooms designed to save exterior cost. The resolu- 
tion of this problem is to consider the compromise that would reduce the 
total cost of both exterior and interior labor and supplies in the production 
of the furnished home. When this is done, the architect often finds it is 
better to waste some exterior material and save on interior material and 
labor, which is relatively more expensive. 
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physically withdraw the book, and then return it. To scan many references, 
you must repeat this frustrating and often fruitless process. The comparison 
and recombination of information is literally a physical chore. 

By contrast, consider the electronic databank or master file that is 
provided with remote input/output devices. A request for information may 
be made to such a system from a remote location. No physical transportation 
is required. The response time is measured in seconds. If the system has 
what you want, you may have your own copy at your own location at once, 
reproduced from a master copy, which is in mint condition. The master 
copy never leaves the file; therefore, it is never lost, stolen, or worn. You 
need not return your copy, and you can destroy it after use. Indeed, the 
display may be electronic, e.g., on a television tube, so that destruction is 
easy. If your original request was faulty and you do not get what you want, 
you can easily repeat the request. You can make many requests and compare 
results, without geographical movement of a single document or yourself. 
The process involves reproduced patterns of intelligence, not physical move- 
ment of material. 

This change in the logistics of information storage and retrieval pro- 
foundly affects the type of questions that wall be asked, the types of com- 
parisons that will be made, and the degree of control that can be applied to 
systems design and use. 

File Scope and Flexibility 

It is not surprising that the scope of electronic master files has drastically 
increased in the past few years. In 1900, massive files of income tax forms 
finger prints, auto registrations, insurance policies, mailing lists, or inventory 
records did not exist. Today, each of these applications calls for files of 
millions or tens of millions of records. The demand for volume storage and 
retrieval coupled with the ever decreasing cost, as well as the flexibility and 
potential for information transmission of electronic memory media has 
made possible monster electronic data files in almost every government 
and industry office. 

Although we have always had libraries and file cabinets, the change in 
scope has another dimension. If data stored in fast access form are organized 
according to the modular principles of the previous section, data elements 
can be recombined and restructured at will. With this modular format and 
electronic storage media, Ble scope becomes an asset, rather than a burden, 
because the fast generation and analysis of a variety of data combinations 
becomes a feasible and productive act. 

The following example, taken from a current application, illustrates 
what can be done with relatively simple equipment if the data are properly 
organized. 
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Example: Alexandria, Virginia, a city of 120,000 persons, recently 
placed records pertaining to taxes, street lighting, road repair, crime, 
city parks, welfare cases, housing code violations, health hazards, fires, 
mortalities, census figures, and other data in a consolidated file, or data- 
bank keyed to street address and to land parcel. 

The city s two master files consist of three reels of magnetic tape— the 
equivalent of over 250,000 punched cards. The first file is by street section, 
and contains 120 items of information about the 3518 blocks and inter- 
sections in the city; the second, or Land Parcel File, contains 91 types of 
information about 20,000 parcels of land. As many as 15 different requests 
for information can be processed against these files at the same time. 
Thus although the main purpose of the master files is for tax billing, 
' ene ' V3 ' !hldies ’ and maintenance 
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The City Manager wanted to evaluate the city’s street lighting program by 
determining what relationship existed between the types of street lights on 
each of the city’s 3518 blocks and intersections, and the number and types of 
crimes and accidents. The Traffic Director asked for a survey of the 1200 inter- 
sections in the city showing those where five or more accidents had occurred in 
the past three months, giving the type of signalization, classification of street, 
type of pavement, and width of intersection. The Director of Public Works 
wanted a report of the condition of all sidewalks in the city. The Police Chief 
requested a summary of all crimes and arrests for each neighborhood. The 
Urban Renewal Administrator wanted a survey made of two proposed urban 
renewal areas to analyze the density and location of welfare cases, minimum 
housing code violations, health hazards, fires, mortalities, crimes, and arrests. 
The Parks Superintendent called for an alphabetical list of the 2318 blocks in the 
city, listing the length of block, width of right of way, and type of sidewalk — 
he planned to use the list to schedule and to record the work of his tree trim- 
ming crews.' 

In most cities the time and cost required for such planning reports would 
be prohibitive. But with Alexandria’s databank the requests were filled by 
Friday of the same week. The cost involved three hours of staff time and 
$67.50 for computer rental at the service bureau. 

Review of this example will give the reader an insight into the planning 
and control improvements that can be made in management information 
systems at relatively little cost when careful attention is given to the ex- 
ploitation of data scope, modular data organization, and flexible storage 
media. 

It is also important that analytical methods in statistics, engineering, 
logic, mathematics, operations research, and management science will be 
given greater power by such stored data— which are the inputs for these 
methods and procedures. 

Skill Files 

Another generally unappreciated change in information handling capability 
comes from the exploitation of skill files. 

Although libraries are full of “how to” books and references, the indi- 
vidual must refer to, learn, and apply what is there by himself. Often the 
process is imperfectly, or erratically carried out. It is also time-consuming 
and costly. 

But, computers and devices like numerically controlled machine tools 
can execute procedures in great detail and precision from prerecorded and 
stored routines without human intervention. The revolution in procedural 
capability of modem systems is not difficult to understand when the impact 
of this fact is fully appreciated. 


f John K. Parker, op. cit. 
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The difference in old and new capability may be illustrated by the 
student of statistics who is ashed to compute the standard deviation of a 
large set of data. Using the old approach, he would refer to a book, learn 
(ire method of computation, manipulate the data, and come up with an 
answer-, process that must he repeated lor each new problem. Vising the 
stored skrll approach, the student would call for the stored machine instruc- 
hom for computes the standard deviation, enter the required data (or 
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can use his human ability to recognize unusual or novel results. He will be 
aided in implementing such innovations by the systems ability to perform 
in short lead times, the delays of training and learning are eliminated or 
drastically reduced ivhen machines direct other machines. Machines that 
are so directed accept novelty with ease.® They are not captives of institu- 
tional constraints. 

Finally, the quality of plans the analyst can produce will also improve 
under these new circumstances. As the analyst sees more alternatives and 
asks for new combinations, there is a high probability that one or more of 
them will be better than those culled from a smaller set in the past. 

Indeed, the entire method of training, thought, and activity of the 
systems analyst will be affected in the future by these new capabilities, as 
will his systems designs, diagnoses and treatments. 


GOAL AND CONSTRAINT REFINEMENT 

As a consequence of improvements in modular construction and data han- 
dling, the analyst has available a third general route to systems improve- 
ment— the refinement of the goal and constraint structure of his system. 

With stored data and skills available, the analyst can more easily investi- 
gate the effects of alternate system goals and alternate system constraints 
that set the strategy of long-range plans. And because such investigation 
will be increasingly easier from a processing viewpoint, the analyst will 
investigate. 

The Trend In Long-Range Planning 

The trend toward greater emphasis on strategic planning is already apparent. 
For example, in 1961 in a survey conducted by the Stanford Research In- 
stitute, 60% of the nation's 500 largest corporations reported some form of 
long-range planning, and 20 % of a sample of 3600 smaller firms were also 
engaged in such efforts. 10 These results were higher than those previously 
observed, and the upward trend continues. 

As the cost of electronic files and real-time communication decrease, and 
as the scope of databanks and the range of skill files increase, casual atten- 
tion to long-range planning will become obsolete. There will be closer 
integration of long-range strategic planning and tactical operations. 

* For example, the creation of new die sets for the production of new model automobiles 
currently follows a sequence of analyst-computer design, machine refinement of pro- 
cedural details, and production on numerically controlled machine tools. Production 
changeover times are correspondingly reduced, and model change is easier. 

"J, K. Allen, "The Rising Acceptance of Corporate Stratrgy," }. of the Stanford Re- 
search Institute, January, 1965. 
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The difference in old and new capability may be illustrated by the 
student of statistics who is ashed to compute the standard deviation of a 
laige set of data. Using the old approach, he would refer to a booh, learn 
the method of computation, manipulate the data, and come up with an 
aimver-a process that must be repeated for each new problem. Using the 
stored skill appmach, the student would call for the stored machine instruc- 
tions for computing the standard deviation, enter the required data (or 
retrieve them from his prestored master file), and request execution. The 
machine would perform the computation under its ou.n direction, do so 
within a predictable time and at a predictable cost, and with a precision and 
speed that is unlikely under manual direction. 


Moreover, the skill once committed to procedure and stored need not be 
Teamed’* again by the operator or user. Such skills can grow in scope as 
new fobs are worked out and stored. The stored skills can be transmitted 
easily from one machine location to another, then executed instantly. Thus, 
a number of workers in remote locations can share their store of skills, a 
fact that greatly ’multiplies the productive output of each. In addition, the 
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can use his human ability to recognize unusual or novel results. He will be 
aided in implementing such innovations by the systems ability to perform 
in short lead times, the delays of training and learning are eliminated or 
drastically reduced when machines direct other machines. Machines that 
are so directed accept novelty with ease.® They are not captives of institu- 
tional constraints. 

Finally, the quality of plans the analyst can produce will also improve 
under these new circumstances. As the analyst sees more alternatives and 
asks for new combinations, there is a high probability that one or more of 
them will be better than those culled from a smaller set in the past. 

Indeed, the entire method of training, thought, and activity of the 
systems analyst will be affected in the future by these new capabilities, as 
will his systems designs, diagnoses and treatments. 


GOAL AND CONSTRAINT REFINEMENT 

As a consequence of improvements in modular construction and data han- 
dling, the analyst has available a third general route to systems improve- 
ment-the refinement of the goal and constraint structure of his system. 

With stored data and skills available, the analyst can more easily investi- 
gate the effects of alternate system goals and alternate system constraints 
that set the strategy of long-range plans. And because such investigation 
will be increasingly easier from a processing viewpoint, the analyst will 
investigate. 

The Trend in Long-Range Planning 

The trend toward greater emphasis on strategic planning is already apparent. 
For example, in 1961 in a survey conducted by the Stanford Research In- 
stitute, 60 % of the nation’s 500 largest corporations reported some form of 
long-range planning, and 20 % of a sample of 3600 smaller firms were also 
engaged in such efforts. 10 These results were higher than those previously 
observed, and the upward trend continues. 

As the cost of electronic files and real-time communication decrease, and 
as the scope of databanks and the range of skill files increase, casual atten- 
tion to long-range planning will become obsolete. There will be closer 
integration of long-range strategic planning and tactical operations. 

* For example, the creation of new die sets for the production of new model automobiles 
currently follows a sequence of analyst-computer design, machine refinement of pro* 
cedural details, and production on numerically controlled machine tools Production 
changeover times are correspondingly reduced, and model change is easier. 

”J. K. Allen, “The Rising Acceptance of Corporate Strategy," J. of the Stanford fa' 
search Institute, January, 1965- 
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As one example, (he integration of operating and capital budgets will 
become commonplace, although in many organizations this tie-in Is often 
not current practice. It will be possible to relate long-range capita] expendi- 
tures to long-range personnel requirements. Long-range forecasts of product 
line profitability, diversification, and divestment will be related to facilities 
planmng and technological trends. The sensitivity of system performance to 
present and projected constraints-imposed by capital, resource, and sldll 
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potential for realistic and swift goal adaptation and long-range planning 
is at hand. 

Thus, the third route to systems improvement requires the analyst to 
use the currently available technological trends to improve system adapta- 
bility and planning. 


A SUMMARY STATEMENT 

In the past thirteen chapters of this book and in our present discussion of 
system improvements, we have taken a sample of current systems thought 
and presented it in a sequence moving from systems definition, to systems 
analysis, to systems implementation and treatment. Although other examples 
and other techniques could have been used, the outline of our presentation 
provides the reader with a framework for his own future study of the 
systems field. The future expansion of “systems thinking” argues for a 
continuation of the efforts the reader has already made. 

If you have come this far you now have a basic vocabulary, a set of con- 
cepts, and a selected set of techniques from which to proceed. The daily 
newspapers, the popular press, and the scientific literature will continue to 
provide an ever increasing supply of specific examples, cases, and tech- 
niques, which may be selected to be congenial to your own interest and 
specific needs. A clipping file and notebook may be suggested again as an aid 
in the reader’s updating. 

In addition to this background information, we intended to give the 
reader some lasting fascination for at least one example of systems applica- 
tion, methodology, or concept. In this chapter we hoped to reinforce that 
interest by citing three powerful trends in current technology and systems 
thinking as they relate to systems improvement and to the analyst’s future 
work. If this attempt was successful, the author is well pleased. 

If the reader’s interest has been caught, it remains only to remind him 
that he is now embarked on a stimulating but sometimes hazardous journey 
into the future— best expressed by an old Chinese proverb: 

"Riding a tiger, it is difficult to dismount.” 


PROBLEMS 

14.1 In modem computer systems designed for versatility of use, a basic 
module of data representation is often chosen within which a variety of 
data configurations can be made. Data storage capacity is then rated by 
the basic module, often called a “byte” or a "slab.” 

For example, consider a "byte” which consists of a block of 12 binary 
bits. This block of 12 bits can be used to represent (1) two 6-bit binary 
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coded ***** ctarsckers, (21 *« «*» »r* I"*"*' P) ,0U ' 
Sbit octoV • cWt«s, or (4) 12 [>«■» ctoaclers. 

A. \Vhy is *e choice of b>1e sise (12 biw W«> ■» taportort ^ 
decision in the information handling system. 

B. A colter tnestsoty is fcomvn to hove a capacity * 

defined above. Bow many alphabete Aaractm could lias mmor/ «*■ 
tain? How many decimal characters? How many octal charade . 
many binary characters? 

C Why would it be important for a computer to know that incoming in- 
formation is all numeric, or all alphabetic? Why would it be desirable to 
use different modes of coding for these two forms of information? How 
would module use change under these two different input conditions* 
What dispensation must he made of the module or byte configuration It 
the computer does not know whether incoming information is numeric or 
alphabetic? 


14.2 An interesting derice to improve the performance of a system is the 
inclusion of a “look-ahead" feature in its design. For example, experienced 
copy typists read several words ahead of their actual typed words. Ex- 
perienced radiotelegraph operators lag their transcription behind what 
they hear. Modem real-time computers also make use of a similar feature 
and “look ahead" at work to be done, so that data and subroutines needed 
for shortly following jobs will be on hand when needed. 

A. Consider this feature and the balance between the amount of lag 
(required for look ahead) and the variety of response that can be gained, 
by using this delay to nuke corrections and reallocations of resources versus 
the possible danger or loss that could occur from delay. 

Q. In what way could the look-ahead feature be used without danger, or 
with mitigated danger, in information handling systems? 

C. Comment on the following quotation from S. W. Ransom and S. L. 
Clark, The Anatomy of the Nervous System, Philadelphia; Saunders, 1959 
(Tenth Edition), pp. 381-382. How does this description of the human 
nervous system relate to the operation of real-time computing systems? In 
what ways could advance instruction or advice aid in the response prepara- 
tion described below? 

“The more elaborate the development of the nervous system, the 
greater its capacity to react with discrimination to stimuli. To put it in 
other terms-. The more intercalary neurons in the nervous system the more 
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possible internuncial paths are there between the Incoming sensory stimulus 
and the outgoing motor ones and the less sterotyped the response may 
be. Man with his most highly developed nervous system has the greatest 
opportunity for variation in response to stimuli. With more internuncial 
neurons between the receptor and effector, there is a delay in response to 
a stimulus, and so it is possible to state that the cerebral cortex must affect 
action in at least three ways: by delaying the response to stimulus, allow- 
ing for a choice in the response, and aiding in integrating the action 
involved. The time of conduction being relatively brief, the advantages 
in choice and integration of the response more than compensate for the 
implied danger of the delay.” 


14.3 Why does the ability to forecast results in detail coupled with the ability 
to compare the actual results with the forecasts lead to more realistic 
planning? 


PROJECT 10 

According to Business Week, May 14, 1966, pages 164-166, (“Information be- 
comes a hot item”), businessmen see in the growth of databank services a 
"fantastic market." 'lire magazine explains: "OJdJine information suppliers are 
being joined by a diversity of new sources, including computer makers, economists, 
consultants in engineering and programming, market research outfits, Western 
Union and other communications companies And there's a flurry of mergers, 
acquisitions and joint operations as the computer specialists try to tie in with 
existing databanks and information services." 

For example, the article cited reports a number of databank services as of 
the article’s date: 

1. International Telephone & Telegraph Corp. has a databank for economic 
forecasting and one on world trade statistics for the United Nations. 

2. Bunker-Ramo Corp. has a financial data service with on-line stock prices. 

3. McGraw-Hill is selling new computer-produced construction market data 
through its F. W. Dodge Co. division the old-tine construction data supplier. 

4. IBM has formed a new Information Marketing Group to sell data obtained 
by combining Dun & Bradstreet’s file of corporate information on 390,000 U. S. 
businesses (95£ of all manufacturing) and the Commerce Departments input- 
output" data on interindustry transactions. These combined files will be used 
by the computer to provide customers with market potential estimates, lists of 
customers, estimates of market share and the like. 

5. Some 2000 economic series will be stored by a private service in a com- 
puter at Carnegie Institute of Technology in Pittsburgh, on tap by Teletype for 
economic analysis and forecasting. 

6. Compustat, a service of Standard & Poor’s, subsidiary of McGraw-Hill, 
provides complete computer tape records on 1000 companies’ stock prices, sales, 
earnings, dividends, capital investments— a total of 60 items. 
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7. Legal Research Service, Inc., and Western Union Telegraph Co. in a joint 
venture provide legal precedents direct by Telex printer to law offices, based on 
a ten-digit inquiry code and a master file of over 1 -million coded cases. 

Many other databanks— from medicine to government— in combination with 
real-time communications and data processing facilities are currently in the process 
of revohitioriing the information-handling and data-analysis capability in many 
industries and administrative departments. 

A. Make a study of the databank and information-handling capability of an in- 
dustry of interest to you and report on the effects the current trends in technology 
will have upon your specific area of interest. 

B. What effects will these facts have on your own chosen field of interest-in 
terms of your own employment and promotion possibilities? 



APPENDIX A 


MATRIX OPERATIONS 


A rmtrix is a tabular array of data, organized in a standard form, which 
™ay be handled as a package in mathematical calculations. A matrix's size 
is called its order, and is denoted by its rows m and its columns n. Thus, a 
matrix with three rows and four columns is a 3 X 4 matrix, or a matrix 
of order 3 X 4. 

In standard notation a matrix’s rows are numbered i ~ 1, 2, 3, . . . , m, 

and its columns j = 1, 2, 3 n, so that a given cell in the matrix, found at 

the intersection of row i and column is denoted by the subscript ( ij). Thus, 
a number in row 2, column 3 is designated Figure A-l shows the stan- 
dard nomenclature. 

Thfc rra'sm typtraiiuro sA addition an4 wb ftTOtfton, wViYth dftfrrrt.4 
only for matrices of the same order, follow the usual rules of algebra. The 
sum of two matrices of the same order is a matrix of identical order which 
has as its entries the sum of the corresponding cells of the first two matrices. 
Thus, if c,f is a cell in the matrix sum, and the corresponding cells in the 
matrices added are a,, and b,j, respectively, then matrix addition can be 
defined as performing the operation. 

On — on + bn 

for all m X n combinations of f and j. As in usual algebra, for the result to 
make sense, the row and column definitions of the matrices added must 
be identical. 
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Matrix subtraction is thus equivalent to algebraic addition with the sign 
of the subtrahend changed. Thus, if [A] is stock on hand and we sell [B], 
then we have [C] left, as below. 



s 

M 

L 


S 

M 

L 


s 

M 

L 

R 

2 

3 

5 

R 

2 

1 

2 

R 

0 

2 

T 

B 

1 

6 

2 

- B 

0 

2 

2 

= B 

i 

4 

0 

W 

7 

6 

E 

W 


IT 

1 

W 
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4 
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Note that in matrix addition and subtraction, the units, i.e., the row and 
column designations, remain fixed throughout the operation. No transforma- 
tion of units takes place. The rules for addition and subtraction are the same 
for any order matrix; we repeat the required operation for each correspond- 
uig cell ({/) until all m X n combinations have been completed. 

Matrix multiplication requires special treatment. For simplicity, it can 
be thought of as a form of accumulate-multiply operation. To introduce this 
topic, we will first consider matrices that have only one row or one column. 

A matrix with a single row is called a row vector; a matrix with a single 
column is called a column vector .* Suppose we consider a row vector with 
three column positions, i ~ 1 , 2, 3, and a column vector with three row 
positions, ; = 1, 2, 3. Then, we may define vector multiplication as the 
accumulative multiplication operation 


fli b t + a.b 2 + a s b 3 = c 


in which the final answer is a single number, Note that in this operation 
the units of vector [a] and vector [b] are transformed, and that for a correct 
answer, the column units (or dimensions) of the row vector must be the 
same as the row units of the column vector. 

Example: In the warehouse we have small, medium, and large sweaters 
priced according to size. Suppose we have a row vector giving the total 
number of units in inventory by size, and a column vector giving the 
unit price by size. The vector multiplication operation gives the total 
value of inventory in the warehouse— in dollars, not pieces. In general, 
the resulting units will be those of the row vector row and the column 
vector column. For this example the size of the garment is lost. 


A row or column vector is usually denoted in lower case by one of the equivalent 
forms: [a], a, or simply a, when the meaning is clear. For e3se in typesetting, paren- 
theses are often used to replace the boxes shown above. 
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S Fa. Unit 


Number 
o! Units 
instock 


a & 


x 


Ml 3 



Input vector to] Transform 
vector lb] 


r~n Number of 
1 105 1 $ in stock 


Output 
vector 1c] 


To be certain we understand how the row and column units combine, 
we can vmte out the calculation in detail: 

(10small)($2/stnall) + (2 medium) ($3/medium) 4* 

(4 large )( $4 /large) = $105 

Note that the row vector column dimensions and the column vector tow 
dimensions always cancel out in the multiplication of the individual terms, 
leaving a uniform dimension for each term. Only when this is true can we 
sensibly perform the addition operation. 

To reinforce the idea of a transformation, it is helpful to think of a 
block diagram, as shown in Figure A -2. Here three inputs (in pieces) come 
into a box, which converts them to a single output (in dollars). The inputs 
come from the row vector; the required transform comes from the column 
vector; and the transformation comes from the multiplication of the row by 
the column vector. 

Having mastered this idea, we may now apply it to an entire matrix, 
although some care is required. Two matrices (A] and (B] may be multiplied 




" S'itsiis VTlT' m ot ' era ' i0 "- Tte ”*<*«»>» 01 «. 

"«“«» «» Urtow. i» L ' h “ '” ac, ‘' b< “ ,ra ™ , ° ,m 
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only if the number of columns of [A] is equal to the number of rows of [B]. 
The resulting product [C] will be a matrix with the same number of rows 
as [A] and the same number of columns as [B]. A helpful formula to re- 
member is 


[A] X [ B ] = [C] 

" »Xp 9X» mXn 

Thus, if [A] is an m X p matrix and [B] is a q X n matrix, then [C] is an 
m X n matrix. The operations we perform are identical to vector multiplica- 
tion, except we consider each possible row and column vector multiplication. 

Example: Consider the inventory given by matrix [A] of the previous 
addition example, by size and color. Suppose we make a different dollar 
evaluation for each size sweater, but make no distinction by color. What 
is the total value of merchandise in the warehouse if we wish to retain 
the color classification? The result is shown below, the multiplication of 
each inventory matrix row times the price column vector. 


S M L 8 8 



Thus, to obtain $33, we perform the arthmetical computation: (2X2) 
+ (3 X 3) + (5 X 4) = 33; to obtain $28: (1 X 2) + (6 X 3) + 
(2X4) = 28 ; and to obtain $44: (7 X 2) + (6 X 3) + (3 X 4) = 44. 


In general, when we multiply two matrices of appropriate order accord- 
ing to this procedure, each possible row-by-column combination gives a 
number, which we enter in the answer matrix cell corresponding to the 
intersection of the chosen row and column. Thus, multiplication of row ( 1 ) 
of the first matrix by column (1) of the second matrix would produce a 
result for cell (1,1) of the answer, the process being repeated until all 
cells of the answer matrix are filled. 


Example: The easiest way to understand the operation is to see a few 
correct numerical examples: 


14 

20 

30 

44 


(b) 
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It would be impossible to matrix-multiply in the following two cases, 
because the row and column requirement is not met. 
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At first glance, we might be tempted to use direct matrix multiplication 
of the piece inventory [A] by the value distribution [B], because the row 
and column requirement is met and we could come up with some numbers. 
But there is no way to match the units of size and color to get a sensible 
answer by matrix multiplication as it is defined. 
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The haberdasher would have no difficulty in coining up with the result 
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28 
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Value of inventory 
by size and color 


and he would obtain this answer by multiplying corresponding cells of [A] 
and [B] to get the cells of [C], i.e., he would find = a i} b if (whether he 
blew he was performing this fancy operation or not). But, this result and 
operation are not matrix multiplication , by definition. 

The reader may have concluded by now that matrix multiplication is 
useful when a transformation of units* is needed, which is not the case in 
the last example. Matrix multiplication, like vector multiplication, is a 
transformation operation, a point we shall return to in a moment. 

Before continuing, the reader should note that the sequence in which 
matrices are multiplied is critical, because, in general, AB ^ BA, First, con- 
sider two matrices of the same order, which may be multiplied using the 
rules above. By direct application of these rules, we see, for example, that 


0 

1 

6 

8 


* A matrix, such as a transition matrix, may indeed have the same row and column 
designations, say states (see Chapter 2). But although the state names are not changed 
by matrix multiplication, the states occupied, or the state probabilities are. 
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Moreover, if two matrices are not of the same order, matrix multiplication 
may not be defined when the seqcenoe of multiplication is reversed; a 2 X 3 
matrix may be multiplied by a 3 X 4 matrix, but a 3 X 4 matrix cannot 
be multiplied by a 2 X 3 matrix. Finally, a check of the definitions of the 
rows and columns of the two matrices to be multiplied will usually indicate 
that a reverse of the multiplication sequence, even if possible, will not make 
dimensional sense, nor be numerically equal to the original sequence of 
operations. The critical nature of the sequence of matrix multiplication will 
be mentioned again as we discuss matrix inversion , a topic we introduce 
via the following discussion of simultaneous linear equations. 

A major application of matrix methods is in the solution of simultaneous 
linear equations. For example, suppose we have two simultaneous linear 
equations with two unknowns 


8x. + 4x z = 400 
5ii + lOij = 400 


which have the general format 


+ a«x, = Ci 
0*1*1 + OttX t = C t 

Before discussing a matrix solution for these equations we will benefit 
from doing a little algebra in which we study a standard arithmetic method 
Urat will progressively eliminate variables from any set of simultaneous 
linear equations, i.e., a canonical procedure for getting the job done. We 
use the numerical example above for clarity and indicate the solution 
steps required. 

Step I: Divide the first equation by 8, the coefficient a„, to give 

*. + Jtx* = 50 
5xi + 10x, = 400 

(2 Meav?,i e SUhtIaCt 5 ' ‘ he CW:fficie '" Ration ( 1) iron, Equation 


i. + Kx, = 50 
0 + 7Sz, ~ 150 

the Cyde ’ *** EqU3H °" < 2 > * 7* the 


*. + ix, = 50 

0 + *i =20 
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Step 4: Subtract 0.5, the coefficient a,,, times Equation (2) from Equa- 
tion (1), leaving 

x t + 0 = 40 
0 + x 2 = 20 

Thus, the solution is x 2 = 40, Xs — 20. 

Note that we have progressively eliminated variables from left to right 
using repetitive division, multiplication, and subtraction. The method may 
be extended to n equations in n unknowns, although the amount of work 
goes up rapidly with the size of the array. 

However, in general, using the above process we always obtain (1) a 
unique solution, as above, (2) no solution because the equations are incon- 
sistent, e.g., Xj + x 2 = 1, and x 2 + x 2 = 2, or (3) an infinite number of 
possible solutions because the equations are not independent, e.g., x, -f x* = 

1, and 2Xi + 2x, = 2. The canonical procedure will fortunately point up 
the latter two special cases by showing a contradiction, to indicate incon- 
sistency (e.g., 0 = 2), or by dropping two variables (or more) in a sub- 
traction, to indicate there are two or more dependent equations somewhere 
in the set. 

Clearly we could write the two equations fust solved in matrix and 
vector form* 


rs 

[_5 10J LxJ [_400j 

where the first matrix, say [A], is the matrix of coefficients of the two equa- 
tions in which the position in the matrix corresponds to the position of the 
coefficients in the equation array; and where the column vectors, [x] and [c], 
represent the unknowns and the constants to the right of the equality, 
respectively. 

Thus, the set of two equations in two unknowns may be written in 
matrix and vector notation simply as® 

Ax — c 


a general notation that will do for any system of n linear equations in n 
unknowns. 


4 For simplicity, hereafter we use brackets (instead of boxes) to indicate a matrix or 
a vector Two adjacent brackets indicate multiplication. 

'Also for simplicity, we will understand that a variable without a subscript is a matrix 
or a vector. 
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1! the reader does not see this equivalence, he should carry out 
matrix rrmWjUcaKntt required, noting that too rmtnccs (or vectors) arc 
equal if and only if each of their corresponding cells is equal. 


'8 qprq [ 81 , + 4r,T ROOT 

.5 lOj U«J " LAri + lOrJ L400j 


We may now introduce the following concepts, which follow naturally 
from ordinary algebra. 


The equality of a matrix equation is not altered if: 

(1) The elements in a row of a matrix equation are multiplied (or 
divided) by a constant (since this amounts to the multiplication or 
division of hath, sides of the equation by the same number b or if 

(2) The elements in a row of the matrix equation are added or sub- 
tracted from another ro w (since this amounts to the addition or 
subtraction of equal amounts from both sides of the equality). 


Using these ideas and what we have just learned about the canonical 
procedure for variable elimination, let us solve the same equations by the 
much neater detached coefficient tableau method. 

We first place the matrix A and the constant column c in a tableau, as 
below, (The vertical line represents an “equals” sign with the coefficients 
on the left and the constant column on the right.) 


/8 4 I 400\ 

\5 10 | 400/ 

We now apply the canonical rules of row division, multiplication, and 
subtraction as before— but we now apply these rules to each tableau row as 
if it were an equation— to get 



after Steps 1 and 2, and 

after atep3 3 and 4 of the 
canonical procedure. 


The answer, x, = 40, x, = 20, may, of course, be read off by reference 
lo the position cl the values remaining in the last tableau. For example, we 
Have a 1 , n the x, column and a 40 in the constant column which may be 
, J “ “■ -f ’ or sta P l >’ » = The final solution should always show 
ttlh m f "i e “”^ ning l! d°W" diagonal of (he coefficient matrix, 
with ffs elsewhere. (It i, possible that some of the entries in the constant 
column m'l be negative or xeto, but if we can develop the diagonal by 

toaTpZe^ Can0 "'' Cal f ’ t0C "’“ re ' ™ are aSSUred 01 * station 
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The beauty of the tableau procedure is that we could easily have solved 
for two (or more) sets of conditions, i.e., constants, while we were doing 
the rest of the work. For example, suppose we started out with 

/8 4 | 400 600\ 

\5 10 I 400 300/ 

where we have two columns of constants on the right, representing two 
cases, and our equality considerations apply to the 400,400 column as Case X 
and to the 600,300 column as Case 2, although our procedure of solution does 
not change from the one-case computation. 

Following the standard rules again, but applying them across the entire 
row, including both cases, let us see what happens: 

/8 4 ( 400 600\ 

Vs 10 ) 400 300/ 8 a 

(o 7.5 I 1M -7s) 

(o 1 [ to -fo) * fte ”* e P s3 “’ d4 ' 

The answer for Case X is, as before, Xi =40, x 2 =20, found in the first 
column of constants. The answer for Case 2, x, — 80, x, = — 10, appears in 
the second column of constants. Note that the detailed computations on the 
left-hand side of the tableau are in no way affected by the introduction of 
the additional case. Note also that this procedure may be extended to any 
number of cases by adding more columns to the right-hand side of the 
tableau. 

For reasons that will shortly be dear, let us solve another example using 
the same two equations, but under two different constant conditions, namely. 


(! 


1 o\ 

I o 1/ 

A 0.5 | 0-125 0\ 
\0 7.5 1 -0-625 1/ 


A 0 
\0 1 


0.1G7 -0.067N 
-0.083 0.133/ 


start 


after steps 1 and 2. 


after steps 3 and 4. 


In the final table we have rounded-off to three figures, but, to that 
accuracy, the solutions may be read as before from the final tableau by 
remembering that the “1" in row 1 represents i, and the “l” in row* 2 repre- 
sents 2. 
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Again for reasons of later need, sve should note the interesting outcome 
wherfwe'multiply the original matrix of coefficients A by the matrix fonn 
from the two solution columns in the final tableau, considering these column 
"l a 2 X S matrix B. If matrix A is multiplied by majnx t B we ,£ 
another 2X2 matrix I, which has l's along its main diagonal and Os else 
where:* 


("8 41 f 0.167 -0.0671 H 01 
\fi 1oJ X L- 0 083 0.133 J |_0 lj 

We have already seen that a system of n equations in n unknowns may be 
written in matrix notation as Ax = c. If it were possible to divide matrices, 
we might have an easy solution for the "unknowns” of our problem by divid- 
ing to obtain x =* cl A, remembering, of course, that we were dealing with 
blocks of data, not just single numbers. (If we could divide, the resulting 
column vector x would give all n solutions to a set of n simultaneous linear 
equations in n unknowns.) 

But, unfortunately, matrix division is not defined, because, in general, 
matrix multiplication is not commutative ,* that is, AB ^ BA. (The reader 
should convince himself of this by trying a few simple examples.) 

Because we cannot tell whether x = c/A arose from Ax = c or from 
xA = c, which, in general, would be different, the equivalent of matrix 
division, defined to avoid this pitfall, is called matrix inversion. 

To introduce this topic, we must first define a special square matrix, 
called the identity I, which we have seen before, namely, for the 2X2 case, 

'-Ga 

The identity I acts like a “I” in ordinary arithmetic and is commutative 

AI = /A = A 

Now, we may think of the inverse of a matrix as roughly the same as a 
reciprocal* For example, suppose we can find a matrix B, which when 
multiplied by matrix A gives the identity I. Then B is by definition the in- 
verse of A and is denoted by B = A* 1 


TK, r^„l, .Wn l. o! coarse, to TowuWog ™. In this ease lie exact 

S o >? one deetaal place. The Interested reader may 

? J h , multiplication shown. He wrll d, .cover the effects of minor 

.SSKLSS^T 1 0,11 ““ u ’ n 

Although it fa Always associative, U., ABC = At BC 1 = ( AB 1 C 

A reciprocal is, of course, defined as follows’ if n r- « „ , . . 
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If we can find matrix B, then it must have the following properties: 

AB = AA' 1 = BA =A'*A = I 

which shows that a matrix and its inverse are not only commutative, but 
their product is the identity I. 

If we can find the inverse of the matrix A, then the solution of a set of 
simultaneous linear equations can be obtained directly, because 


Ax = c 
A-'Ax = A~'c 
lx = A~ l c 
x = A~ l c 

Moreover, we can solve the equations by a simple matrix multiplication. 
I or example, returning to the two equations already solved by the detached 
coefficients tableau method, we find that we have already found A for 
the coefficients matrix A, namely. 


j-8 4-1 T 0.167 

-0.0671 

L5 loj L “0.083 

0.133J 


and, according to our argument, the solution for Case I, the constants 400, 
400, may be obtained directly as 


x 


-[ 


0.167 

-0.083 


-0.067") ["4001 ^ ["40") 
0.133 J [.400 J ~ L 20 J 


which checks out upon multiplication of A -1 by the constant column vector. 

To refresh our memory, let us formalize how we obtained the inverse 
of matrix A. To obtain the inverse of a square matrix 9 A, create the tableau 
M|I)> where 7 is an identity matrix of the same order as A. Using the 
canonical procedure , convert the original tableau into the tableau (7|B), 
i.e., into the final solution, if it exists. If the final solution tableau can be 
found— and it can be, if A does indeed have an inverse— then B is the unique 
inverse of A, or A' 1 . 

Why is it worthwhile to go to the trouble to find the inverse of matrix 
A? In addition to the ease of notation and other theoretical benefits de- 
rived from use of the inverse, the repeated solution of large systems of 
linear equations can be obtained most efficiently by this approach. 

Returning to our last example, we observe that because the number of 
columns of the inverse is always equal to the number of rows of the con- 
stant column x, we can treat a number of cases as a number of constant 
columns, form a matrix of them, and solve any number of cases in one 
matrix multiplication. For example, taking the two cases we solved for 


* For simplicity we restrict our attention to square matrices. 
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and imta.bg by dots the many others « might want to investigate, 
we have 

T 0.167 -0.067*] [400 GOO 
1 * L— 0.083 0.133JL400 300 

The solutions for x x and x s appear in the fii 
each column represents a different set of 
combination of electronic computers with 
extends our ability to solve large-scale 
equations. 14 (In fact most matrix applicati 
library programs at electronic computing centers; no programing is re- 
quired to obtain the use of the machine capability. ) 

This survey of matrix topics is sufficient to follow the examples in this 
book. The reader’s attention to Reference 1 and 2 will be of great 
benefit for further study. Other references are cited for supplementary 
reading. 

However, as a final point, we should return to the concept of the matrix 
as a transform, and matrix multiplication (and inversion) as transformations 
resulting from the application of a transform. 

The reader may remember that a function, such as y = 3x, gives us a 
point in xy space for every specified value of x, since to every x there 
corresponds a y. A transformation on the other hand maps a whole set of 
points— a function or set of functions— from one set of dimensions (or co- 
ordinates) into another. Thus, for linear systems of the type we have been 
discussing, the coefficients in the matrices shown are transform coefBcients 
and may be thought of in those terms; the act of applying them is a 
transformation. 

A simple example will suffice. Suppose we have two sets of equations, 
represented by the two matrices of Fig. A-3. In this case we have adopted 
the usual “from-to" convention that inputs shall go from a row to a column, 
which may be seen by a comparison of the equations with the correspond- 
ing matrices. The corresponding block diagram is also shown for reference 
in Fig. A-3. 

The multiplication of A by B maps all points of the output variables y* 
and y, into z, and z„ and the resulting matrix C can be used to write 
equations for z, and z 2 in terms of the original input variables and x,, as 
shown. The single "black box" C now contains the detail of both A and B. 



st and second rows of the result; 
constant conditions. Clearly, the 
procedures such as this greatly 
systems of simultaneous linear 
nns of this type are now standard 
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(b) 


A-3 Matrix Multiplication as a transformation. Two matrices A and B correspond 
to the transforms provided by the boxes A and B of the flow diagram. The multiplica- 
tion of A by B, given in (a), transforms the inputs of A Into the outputs of B, pro- 
ducing the equivalent box C, given in (b). Linear equations, corresponding to the 
matrices are also shown. Matrix inversion may be thought of as the reverse of this 
process. 

Without other information, we cannot recover A and B once the grouping 
has been done. (See Chapter 7.) 

Suppose, however, we posed the inverse question: “How do we get 
back to where we came from?” For example, having found that AB = C, 
we might want to get B back again. To do so we have to perform the in- 
verse of what we have just done— “un-multiply” the matrices (or map back 
from the xz space in which we are now to the original yz space of matrix 
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B.) Clearly the solution to the problem, from our previous 
to perform the operations 

A' 1 AB = A'C 
B = A- J C 


arguments, is 


-s* * • 

given matrix, we find that 


A «= 




where we have kept the fractions to prevent rounding-off. Multiplying the 
inverse by C, we have 


B 


A~'C 



-AH'* 23V T2 n 
*Jll8 34 J Ll 3j 


which is what we wanted to get. Some of the mystery of matrix multiplica- 
tion and inversion can now be removed: Matrix multiplication is a trans- 
formation of dimensions in one direction; matrix inversion reverses the 
transformation process. 

We have discussed here only matrix operations for linear matrices— 
those with constants in their cells. However, matrix operations are not so 
limited: A matrix cell may contain a real or complex variable, or it may 
contain real or complex functions. The basic operations remain the same. 
The interpretation and application of matrices using nonlinear and complex 
entries is. however, left to the suggested references. 
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MATHEMATICAL TRANSFORMATIONS 


A model or system description may be easier to analyze if we transform 
the original model into a mathematically equivalent, yet less cumbersome 
form. We then analyze the transformed model and retransform the results 
into the desired model solution. Figure B-l illustrates the sequence of 
steps in this process. 



B-1 The procedure of using a mathematical transformation to ease analysis. Read- 
ing from left to right and top to bottom, the first step Is to make an abstraction from 
the real world. This model may then be transformed to another, more convenient 
form, then manipulated as desired. Thereafter the procedure Is reversed. The results 
of the manipulation, stated In "transform terms," are converted back to the original 
model format. Finally, the model results are Interpreted In the real world context Any 
form of transformation that is one-to-one 13 suitable. Several possibilities are dis- 
cussed In the text. 


473 







474 Appsndix B 


TOO SIMPLE MATHEMATICAL TRANSFORMATIONS 

A wide range of mathematical transformations tan be used in f W» 

One such transformation, familiar to the student of algebra and tug 
eby is the conversion of xy coordinates into polar or circular coordinates, 
with the origin at St = 0, y = 0, as illustrated in Figure B-2 

Thus, the unique point 1 = 3, y = 4 is also uniquely spccihed b> 
giving the values 

r ~ Vx r + ?, 6 = tan ' 1 vi T > 


so that die point x, y may also be given by r and the angle 6, or simply 
r /_ 6, here 5 L 53°, and $ is measured from the right-hand x axis. 

Conversely, given the circular notation r L 9, we can go bach to the xy 
coordinates by using the fact that 


x = rcos0, y — rsm0, 

which permits us to transform the xy coordinates of a point to circular 
coordinates and then bach to xy coordinates. 

Note also that the above transformation (and its inverse procedure) 
will in fact transform or map all points in the xy coordinate system into 
the polar or circular coordinate system on a one-to-one basis (and that 
the inverse procedure will in the same way map uniquely all points in the 
circular coordinate system back into the xy coordinate system.) 

This is the distinction between a mathematical function and a mathema- 
tical transformation: The function, say y = f(x), gives us specific values 
of the dependent variable y for given values of the independent variable x; 


y 



'‘” ,,mon mathematical traralomiat.cn I, mat 
U«ft!re Its novi'.icn hv »»,« C00f ® ,na,B * of a point to polar coordinates, which 

the sntj'.e 0 mealed hem 0x1? oV ThT UV? 81 ' 1 ? 10 the Q0 ' nt to S eUl * r with 

****** at Will, as Indicated in th?? e £ ,f3r »*°nnatlen Is unique and may be 
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the mathematical transformation, on the other hand, transforms all func- 
tions in one set of coordinates into unigue functions in another set of 
coordinates. In short, functions assign numerical values to numerical 
values; transformations, sometimes called operators, assign functions to 
functions. 

Example: An interesting transformation that converts lines to points 
(and vice versa) forms the basis for a recent patent (Patent No. 
3,069,654, P.V.C. Hough, “Method and Means for Recognizing Complex 
Patterns," Dec, 18, 1962). Suppose vve have a straight line in the xy 
plane, which intersects the x and tj axes at points u and o, respectively. 
One possible line/point transformation is to define a new set of axes, 
say s and t, and to let a point in st space be given by the line in xy 
space by using the intercepts u and v to compute 



as shown in Figure B-3(a). In this way, every possible line in the xy 
space will be mapped into a unique point in the st space. Similarly, 
every point in st space can he converted into a line in xy space by 
noting that 


s 

u = s; «> = -» 

where, again, u and v are the intercepts of the line at the x and y axes, 
respectively. The alternate displays may point up patterns in experi- 
mental data, because if a number of points in st space should fall into 
a straight-line pattern, they will be transformed into a series of con- 
current lines (crossing at about the same point in xy space). Con- 
versely, approximately concurrent lines in xy space will map into ap- 
proximately colinear points in st space.’ In various applications, con- 
version from one form of display to the other may aid the analyst in 
“seeking" an otherwise hazy pattern. For example, the Hough patent 
considers the detection of particle tracks in bubble chamber photo- 
graphs (an experiment performed in physics). Photographs of bubbles 
gave points in st space. The objective was to find if there were any 
“tracks” or trajectories present that would indicate the presence of par- 
ticles shooting through the bubbles. By converting the points to lines 
and looking for coincident lines in xy space, lines crossing near the 

In particular. If a series of points in st space should fall In a line given by + ml. 
then the intercepts of the corresponding hues in xy space are if, f/(b -f- o»t )J, and 
each of these lines will pass through the common point in xy space given by (— b/m, 
1/m). 



476 Appendix 



/^orcuoeot 

1 / 

i 

es 


f Colinear points 

4 

/ 


mSmSm 




10 20 30 40 50 60 


B-3 (a) Ttie line-point transformation, (b) Radar bearings as lines, (c) Radar bear- 
ings as points. As discussed In the text, the Intercepts of a line produce two numeri- 
cal values that can be used to locate a unique point in "point space,” as shown In 
(a). An application discussed In the text Is illustrated by (b) and (c). (Source: Airborne 
Instruments Laboratory, Division ot Cutler-Hammer.) 


same point indicated such possible tracks in the bubble picture. In 
another application, suggested by Airborne Instruments Laboratory. 
(IEEE Spectrum, August, 1963, p. 5), the reverse procedure was used 
to resolve somewhat conflicting bearings taken by radio location equip- 
ment Figure B-3b shows the bearings, plotted as lines in xy space, with 
the intercepts for each line being u and e, respectively, as before. The 
picture looks confusing. However, when the lines are transformed into 
points in st space, the points appear roughly colinear, and a line may 
be fitted to them, as shown in Figure B-3e. Reversing the procedure, if 
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we map the line in st space back into the xy space, it will appear as a 
point, denoted in Figure B-3b by a large "X”. It is interesting to note 
that we would probably not have selected the position of the “X” mark 
as shown, had we attempted to do so by inspection of Figure B-3a alone. 

The usefulness and application of mathematical transformations, such 
as the one illustrated in the example, is limited only by the ingenuity of 
the analyst, and so it is frequently useful to imagine other equivalent 
ways in which a system description may be displayed. 


MORE COMPLEX MATHEMATICAL TRANSFORMATIONS 

Two well-known transforms, the Laplace transform and the a transform, 
are widely used in the analysis of systems that can be described by inear 
differential equations and difference equations, respectively. Applying the 
Laplace transform to a differential equation, or a * transform to a difference 
equation, converts these more complicated expressions into an algebraic 
form that can be handled more easily. The Laplace transform and the z 
transform, as we shall see, allow the analyst to compute the output response 
of a Unear system to inputs that are, respectively, continuous and periodic. 
These transforms greatly simplify the analyses of linear systems that exhibit 
lag and feedback. Without the use of the Laplace or the z transform in 
these cases, it is usually very difficult to obtain an equation expressing the 
— • • * - • - « s-t i.e., without recourse to numerical 


output from the system in “closed” form, i 

evaluation of specific input values. , _ . ^ j 

Although, fortunately, we may describe how to use t e P a 
z transforms compactly, a description of why these trans orms «or 
the basis of their development is unfortunately beyon t e scope 
appendix. A complete understanding of the theory o ese ^ ’ 

moreover, requires the reader to be familiar with the su iec , 
“functions of a complex variable,” which is usually somew a y 
mathematical level currently available to most nonma ema 1 
references at the end of this appendix are for those w o wis - anorec ia- 

background on what is to follow here, which will be uni e reader 

tion” of the subject. However, we will assume hereafter that the reader 
has had at least a first course in calculus, because this wil 
for the discussion. 


The z Transform 

To illustrate the 5 transform, we will first show how to obtain the ~ ^trans- 
form of a function, then how to find the transfer function of a s> 
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transform terms, and finally how to obtain the output of a system using the 
previous two z transforms. 

Suppose the input to a system varies with time, 1 but occurs periodically, 
at equally spaced intervals, say minutes, hours, weeks, or months. To indi- 
ca e twtencss, we will call the input f(n) and denote the successive 
H„, L°n mpat fUOCti0 ' 1 bY a " fl - 4 » that, in general, 

5 — * -- 

^ u t D 4-T;vLrco o „ r ri ,u m of n ,ks ““ «• ;i; is 

linearly, so that //„} - !I ' 6 “1 L“ ' vI ” ch 1,1 e input increases 
a. = <.; = c, ~l= ’ “ , Sf, °"? , in B-4b- In the first case 

Cj = 1 a, = 3 ' ■ "■ and m the fetter case a 0 ~ 0, a, ~ 1, 

any arbitrary value norithe " genm '' , ’ 1<! . !u f c ^ s!iv e o's can take on 
/(0) = a,, /(l) =m, /(2) n, /ilf- Ve ' 11 i”" 8 understood that 
value of the Input at a riven !0 m io indicate the 

a formula for providing * as ° T T* ° faterest - «» devise 
«n) = n . ”8 “e ns, as ut (he above ease when f(n) = 1 and 

tv-* 


«•) . 


"- + ‘’,r + 0tf< + <w . + Oi2< + 


• I ««*- T ... ~ 22 /(n)z" 

F(e), the thB function, and 

and rice versa, jus, ^^""“ “‘‘lue-we can go from f(n) to 

rzr d wpo " 

Of fun «5on I^y be d U i£^^ lhat a sped- 

iWe * ,ve ''equities required 
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m 



-2 -1 0 1 2 3 4 5 


(b) 

S*4 Figures illustrating the notation for discrete functions, (a) A discrete step 
function, (b) A discrete ramp function. The function f(n) takes on values only at 

equally spaced intervals of time or in the sequence numbered n = 0, 1, 2, 3 

he function t(n) Is 0 for all negative values of n. 

yet obvious). Values of F(z) will be limited provided that /(n) does not 
increase faster than geometrically.* 

Some engineering and mathematical texts (such as Kaplan, Reference 5) define the Z 
transform as 

HZ) - f ;/(nk- 
o 

where the script Z is most frequently used to indicate the negative exponentiation 
Although the transform tables developed for the alternate definitions will he 
different, the same results are achieved by the consistent application of either. Whether 
or not the series defining F(z) [or F(c)] converges is a question to be raised in 
either case. In most cases of interest F(z) will converge if Izl < 1, and Ftz) will 
converge if |*| > I. Hie reader should obviously take care to determine which of the 
definitions above is used fn a given reference source before computations are made. 
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gam* Applying this treatment to the W *P* *»«* to ** ^ 
vious example, we have 

Flz) + 

Remembering that the sum of a geometric scries is 1/(1 
is less than unity, or by cross-multiplication, we obtain 

1 

F(z) = l + Z + ^ + 2 * + *” 3:1 1 — 2 

We have just found Hz) = 1/(1 - *). *« * tra " s ' oml o! > ln) ’ Whe " 
f( n t =*■ 1. a "step” function. „ 

Continuing in the same way, we find the z transform for J(n) — , 
“ramp" function, namely. 


F(z) = 0*1 + 1 • z + 2 • z* + 3 • s 

= z(l + 2z + 3z* + • • •) 
d 


l + •• 


dz 


(1 + 2 + Z* + 2* + • • ') = 


dz 


-- f>z" 


(r^V 


where we recognize that the series (1 + 2z +3z* + * * • ) is the firs 
derivative of (1 + z + z 7 + s 3 + - • • ), the sum of which is 1/{1 - *)• 
Taking the derivative of 1/(1 — z), tve end up with 


m ■■ 


' Cl - *Y 

which is the z transform of a ramp, /(n) ® n. 

If we continue in this way for a number of cases, we can construct a 
table of functions f(n) and their corresponding z transforms, F(z), 
such as Table B.l. 


It is particularly important to note from the transform pair table. Table 
B.l, that shifting the origin of the time axis one unit to the left, and in effect 
“delaying” the input one unit, is equivalent to replacing /(n) by f(n — 1). 
Moreover, if f(n) is replaced by f(n — 1) and F(z) is the z transform of 
/(nf, then the z transform of f(n — 1) is zF(z). Thus, we can indicate a 
delay of one period by multiplying the z transform of the function by %• 
Similarly, we can indicate a delay of two periods by multiplying by z*; in 
general, we can indicate a delay of r periods by multiplying the z transform 
by s'. (The procedure reverses itself for an advance, or a shift of the time 
origin to the tight. To advance the input by one time period, we divide tbe 
z transform by z, and to advance r time periods, we divide by z T .}* 


,e ’» ■ j6 °’ then tbe value of f( 0 ) must be subtracted 

irom J-U) fadwe tbe division operation, as shown in transform pair (5) of Table B.t. 
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Table B.1* r Transforms 


Pair 

Function 

Transform 

T t 

m 

F(i) - E/(n)z- 

T t 

o/.w + «w 

aFiiz) + bF t (z) 

T, 

/(» - 1) 


T\ 

l(n + 1) 

o 

1 

ST 

T t 

1 (» ** 0) \ unit 

0 otherwise f impulse 

1 

T t 

1 (n > 0) unit step 

a - 2 ) 

T-, 

n (n>0) unitramp 

z 

(1 - *)* 

T, 

o- 

1 

(I - az) 

T, 

na" 

az 

(1 - az)' 

Tio 

a*/(n) 

F(az) 

T n 

(-!)• 

(1 + 0 


’Assumptions: /(n) = 0 for all n < 0. /(n) can take on values only at equally spaced 
intervals numbered n = 0,1,2, 3 In the above table a and b are constants 

Example: We know from the above example that the ~ transform of 
f(n) = 1 for n ^ 0, and /(») = 0 otherwise, is F(z) = 1/(1 — z). 
We argue from the table of transform pairs that the z transform for 
f(n) =1, n^sl, and f(n) =0, otherwise (which is equivalent to the 
original function with n replaced by n — 1), should be zF(z), where 
E (z) is the z transform already found. This may be shown by direct 
application of the definition. For the new function, 

For/(n) r F(z) -l‘l+l-« + l- c*+l. «*+••’>- 
For/(n - l): F(«) = 0 • 1 + 1 • * + 1 ■ *• + 1 • *» + • • • 

- z(l + J + 2’ + 2* + • • ) - YZTz 

That division by z will produce the original transformation, or “advance" 
the input back to the original position, is obvious from the result. 

We have shown how to find the z transform of simple discrete functions, 
how to advance and delay these functions in time by the simple process 
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of dividing or multiplying the * transform by z for each period of advance 
CkarivT""^' "' e haVe deVd0I ’ ed 3 tab,e of hansfonn pairs. 

al^taTIn I" = *T 

J21T SSE& Mi ~ 

that appeared in his table of tnm ^ P rov “kd him with a z transform 
down the function f(n) associated witlfT^ cou,d Mediately write 
oho appear in his table' By £? fimrti ° n "™ ,d 
inverse, /(n). e the z transform to its 

* “u“rt“^^' v rf < v sysi ™ - d -,o P 

case in svhich the ‘ 

y(*r) « az(n) + (1 — a)y( n — j j 

5^«4.taSS«’^ -1) b *« output of the 
2? h familiar from paeeTo" “ * (« <«< D- 

Of the output y(n) to the input x/n) t,!,? , pu , r ? ose is to find the ratio 

tt dV; ?" d ? e 1 tatufo™ o? to i abo assume ,hai "■<= 

tot rf (l > i, a,, . ^ •“ Wm output Y(«). Remembering 

* " e ^ n °;~ ° f 

unction of the system as w ® get the required 

Z&_ 

An ^(*) l _ - 

«e fe « U °No"e d ««todX to'dr 11 

hy multiplying b _ f o indicate a equation on 

end to obtain the tm f ' l US t0 (x| llict all the te Pen ° d delay in time 
original express^ Z^U^T «*» *£ “ *<*>, “ d »(«) 

«■») and p(„ _ ,j ““ ld not have collected « kg. In the 

1) «-ould n„, have been the tm e " “ ^cause 
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r j^. nOW corne t0 P^ ase Ae analysis. Suppose we want to 

tnd the output from the system described by the transfer function 
i (z)/X(z). How can we do this? 

Given a speciBc input function x(n) and therefore X(z), which may 
be found from it, we may obtain Y(z) as* 


Y(z) = X(z) 


m 

X{z) 


Example: Suppose the input to the system is x(n) = 1 for n^O, and 
^(n) =0 otherwise, which is the "step” function previously discussed. 
Then the output of the system— in 2 -transform terms— is given by 


r <^rh[rz^] 

Similarly, suppose the input to the system is x(n) = n for n^O, and 
x(n) =o otherwise, which is the "ramp” function previously discussed. 
Then the output of the system is given by 


K<i) “ (1 - (1 - tt ) z ] 

[All the examples given here assume system linearity, so that the transfer 
function Y(z)/X(z) is independent of the input to the system, and may 
be tested for any desired input function X(z) without modification-] The 
function Y(z) gives the total picture of the system output, given the speci- 
fied system input, but does so in terms of the "operator” z. What remains 
is to convert from Y(z) to t/(n), which requires that we find the inverse 
transform of Y(z). 

The reader may now realize that the complete output specification of 
the system is available through his knowledge of Y (z), yet he may also 
realize that the inverse transformation y(n) is not immediately obvious, 
because in neither above case does the z transform obtained (and therefore 
its inverse) appear in Table B.l. This is the usual situation. 

However, if it were possible to break up the more complicated z trans- 
form representing the system output into a sum of simpler terms which did 
appear in the table, we could get the final result, the inverse of the z trans- 
form Output, by inspecting the table. 

The technique usually used for this purpose is called the “method of 
partial fractions,” and is described at length in References 4 and 7. How- 
ever, a short example will suffice here. 


‘The multiplication shown is not always correct, but will be so for the usual cas 
which the transformation is made at the discrete intervals of time n=0. 1, 2, 3, 
and not at others. For more complex cases, see Reference 4 
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° S ™ '"“'“rVng 'he a transform by a for each period of advance 
and " e . h ™ devel °P“l » ‘aMe of transform pairs. 

time sSl .“ ad n ’ a|M 'TT °' lhe 1 toMfonn b a shift in the 
toe Kale-an adsance or a delay of an input function-may be aceom- 

m'^TI.To^ the , “ has b6 “ converted^o £ Z 
would eliminate much f tb* ° * ,3rge ta ^ e of pairs 

cause ;„„Z tc Zr ° f ,ransfo ™ P a “ development, be- 
repeat this work. Finally hen ** /l” wri . tlen ^ 0vvn > there is no need to 
the reader should realize that i?!° 6 un5gueness of the transform pairs, 

that appeared In his table^of ^ a 3 ^ransfon^ 

dowm the function /(n) associated COuIt ! Mediately write 

also appear in his table. By this menn« . ’ ^ ecause that Unction would 
inverse, /{n). " e 8° from the z transform to its 
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5 ! nation is familiar f rom pa ce 47 “ “ “ “"s' 3 ”* (° <« < 1J 

ofthe output y(n) to the input wm rf P “? 0!e h "> &d the rati 

"C hm, boTl.'fc £ WeZ^A * 6 6 tra “ f6nn « 

’■ ’ ve TOy “7 ~ • 

P(c)=oX(:) + !1 


naion ot the system as W/*(s), we get the require! 


11 *)^ 
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,h6 — [ °" 
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We now come to the final phase of the analysis. Suppose we want to 
find the output from the system described by the transfer function 
Y(z)/X(z). How can we do this? 

Given a specific input function x (n) and therefore X(z), which may 
be found from it, we may obtain Y(z) as 5 


Y(z) - X(z) ■ 


m 

X(z ) 


Example: Suppose the input to the system is x(n) = 1 for n^O, and 
x(n) =0 otherwise, which is the “step” function previously discussed. 
Then the output of the system— in z-transform terms— is given by 


Similarly, suppose the input to the system is x(n) — n for n — 0. and 
x(n) =0 otherwise, which is the “ramp” function previously discussed, 
Then the output of the system is given by 


Y(fi ■ 


(1 


j_ C— ^ — 1 
- zy Ll - (1 - a)d 


[All the examples given here assume system linearity, so that the transfer 
function Y(z)/X(z) is independent of the input to the system, and may 
be tested for any desired input function X(z) without modification.] The 
function Y(z) gives the total picture of the system output, given the speci- 
fied system input, but does so in terms of the “operator” z. What remains 
is to convert from Y(z) to y(n), which requires that we find the inverse 
transform of Y(z). , c ,. r 

The reader may now realize that the complete output specification of 
the system is available through his knowledge of l(z). yet e may a 
realize that the inverse transformation »(n) is not immediately obuous, 
because in neither above case does the z transform o fa e (an ere 
its inverse) appear in Table B.l. This is the usual situation. 

However, if it were possible to break up the more complicated , mn* 
form representing the system output into a sum of ampler termwhleh^ 
appear to the table, we could get the final result, the inverse of the . trans 

partial fractions," and is described at length m References 4 and 7. How 
ever, a short example will suffice here. 


, , . , correct but will be so for the usual case in 

The multiplication shown is not al*ay Inters als of time n = 0. 1, 2, 3, . . . 

which the transformation is made at 4 

and not at others. For more complex cases, 
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By the method of partial fractions w#* tn c„j r « , _ 

fractions which is identically e„u Z 1 ", t£ “ * f 
that is to be broken m Tim S ’ mDre c ° ra pk*. fraction 

the faction to be bS J To ™ kT T/ ” Ae d ™«»b»tor of 
terms, then that facfa„ ro ^ b et^lT^ «*-*» 

which has as its denominator oneTthe 1" e T r° f ' aCti ° nS each of 

different treatment is required whence facto ^ faC *° rS ' A somewhat 

methods are illustrated in the following example/ repe3,ed 71,6 t ' m 

Example: In the 2 transform 


Y(?) 


»~~r — i 1 


* * L‘ - (l - 0)zj 

telff --^.nrts, namely, 

iora vo c a,,rac,io ” ,c ™ *• « -*»s- 


PM 






enforcing the'fe,u/u'erapi tJ lhe ” Umera,ors A an d B are found by 
2£- for /. multiply bother* f0r A anti 8 i5 as follows; 

3nd ='■ Tltis Immediately ”eMsV ' de "Ommator of A, (1 -*>, 


«»* si des b y f . 

w 8 ~ -(I — a) 

We may now write 


(1-fl) 


nr)- 




* ge ‘hort^it method *f * ^ * C ° nStant not aff< 

1 "“Aerators exce 
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result °f the inversion). By inspection, then, we find from Table B 1 
that the inverse of Y(z) for the step-input case gives 

y(n) = 1 - (1 - a) (I - = 1 - (I - 0 )..i 

Srfi nt , e h rPrela ! i0n 'f r “ ult is lhat 11,6 &sl leral is "steady-state" 
seoon T, 6 ^ T I™ 15 ° ff l ° the va,ue of the s,e P input— and the 
econd term is the transient" term that dies out with time. This result 

wort?? P e n sp f cificati0n ol the system output, given the system input, 
orks for all values of (O^a < 1) and for all possible step functions, 
ecause if x{n) = A • 1, where A is a constant scaling factor, then y(n) 
given simply by Ay(n), a constant times the result already obtained. 

1 timing to the case of the ramp input to the system, we find that we 
nave repeated factors in the denominator of the fraction to be broken 
up, namely, the factor (1 - z)* = (1 -z)(l - z). In cases of this sort, 
an expansion of the form below is used: 


Y(z) ~ — £ f_ a 1 Az , B , C 

(1 - 2)*Ll - (1 - a)z J (1 - z)* + (1 - z) + l - (l - a )z 
That is, the repeated factor is repeated in the expansion in ever-decreas- 
ing powers and with decreasing powers of z in the numerator. This form 
ot expansion usually produces transforms that can be found in Table 
j • A S ain , we solve for unknowns, A, B, and C, as before. To solve 
*. 0r _ A * multiply both sides by (1 - z)* and set z = 1, which gives 
~~ (1 —a)] = 1. To solve for C, multiply both sides by 
~~ a ) z an d set z = 1/(1 - a), which gives (after clearing frac- 
tions) C — (1 a)/a. Finally, if we set z = 0 on both sides of the 

partial fraction expansion, we get B = -~C or B = — (1 — a) /a, so that 


z 1 - a 


r 1 l 

(1 - z)> a 

ll - */ a 

.1 - (1 - a)t J 


is the completed expansion in partial fractions. 

Each of the terms in the expansion may now be found in Table B.l, 
with the corresponding inverse terms 


y(n) = n - + (1 — a)". 

a a 

The interpretation of this solution is that the system follows the ramp 
input (term I), but never quite catches up by the amount shown (in 
term 2), and also shows a transient at the start, which dies out with 
time (term 3). All of this is available from the closed formula repre- 
senting the output of the system in response to a ramp input. 

The examples shown do not exhaust the manipulations that may be 
necessary to find the inverse of a z transform. Indeed, the art of using 
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these transforms lies in the analyst's ability to break up a cumbersome 

j-StoisSj’Jfc.r T ic r e,l ’ ausiiv,: •»»*» too the 

tag for ihe r dc . r sh0uld >>-0 a feel, 

the importance of being able to handle' la^a T'!? a " d hc shou,d roalizc 

function with algebraie^ase Note 1, £ 1 <W,C ,imc 

tion in the analysis of camnUrt l “ r,ms ^ orm finds 'vide applies- 

analysis of <> "* ” «»» 

— ? — «* - 

for a wide range of possible svstrm m cI °sed erpression is desired 

for a specific system input and meeifie^T m ' ! ' T ° deve,0 P *0 output 

evaluation of the difference ecmation f ;,l!m Parameters, direct numerical 
wnk. In any case, it fa usua% worthwhile Sy!,em ma >' bc ,e!s 

by comparison with the first few , e ™f „f Wk 'a' 6 Mranst °™ results 

from the original difference equation, f " SpMific SD ’ u,ion computed 


The Laplace Transform 

by 11,6 up,a “ «•— 

^ on 

equivalent j^ ^ o'hcrwiw, shown ta 3? = '■ tm 

1 may “• - * r z us n f r F 2 ^ s £ 

' natty th ‘ ,US ' " llegral yalucs- 
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B*S Figures illustrating the notation for continuous functions, (a) A continuous step 
'unction, (b) A continuous ramp function. The function t(t) takes on values at all 
values of the independent variable /. The function f{t) is 0 for all negative values of t. 


Now consider what would happen if we carried out the following inte- 
gration, which converts /(f) into a new function F(s) defining the unique 
Laplace transform: 

m - I'fWe-dt 

Example: Using the two former examples of the step and the ramp 
inputs, but treating them as continuous inputs rather than discrete in- 
puts, we may find the Laplace transform of each. Thus, for the step 
input, f(t) = 1 , for f 0, and f(t) = 0, otherwise, we have 



and, for the ramp input, f(t) = t, for t 0, and f(t) =0, otherwise, we 
have 
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F (*) ~ j te~“ dt, 

which, upon integration by parts, 8 gives 

may “ • 

Tabie B.2 Laplace Tramlorms' 
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Note from the table, that differentiating a function of t multiplies the 
Laplace transform equivalent by s. That is, if F(s) is the Laplace transform 
of /(f), then sF(s) is the Laplace transform of df(t)/dt. 9 Conversely, if 
F(s) is the Laplace transform of f(t), then (l/s)F(s) is the Laplace trans- 
form of the integral of /(0» /(0 

Example: Using the step function, f(t) — 1, f — 0, as an example, let 
us find the Laplace transform of £ f(t) dt. In this case jf f(t) dt = 
j a dt — t. The Laplace transform of /(f) ~ t, from Table B.2, is 
F(s) = I/s 2 . But, also from Table B.2, the Laplace transform of /(f) = 
1/s. Thus, if F(s ) = 1/s is the Laplace transform of f(t) = 1, then 
(I/s)(F(s) = 1/s 3 is the Laplace transform of j ( Q ‘f(t) dt = f. The re- 
verse procedure also holds. If /(f) = f, then df(t)/dt = 1, and the corre- 
sponding Laplace transforms, from the table, are 1/s 2 and 1/s, respec- 
tively. Thus, differentiation corresponds to multiplication by s. 

With this information available, we may now proceed as before to 
develop the transfer function in Laplace transform terms for a system that 
can be described by a differential equation. 

Example: Suppose we have a control system which operates continu- 
ously according to the following equation, where a and b are constants 
and | i/(f) and x(f ) are the output and input, respectively: 

3/(0 =<■*(<)-& (^f)’ 

Assuming that we can find the Laplace transform of i(t), from Table 
B.2 or by the definition, and, therefore, obtain X(s), and that we will 
be able to find the output Y(s) as a result of the analysis, we may con- 
vert the differential equation into the equivalent Laplace transform 
relationship 

!’(,) = uX(») -bsY(s), 

if we remember that differentiation of a time function is equivalent to 
multiplication of its Laplace transformation by s. Collectmg terms in 
T(s) and X(s) and solving for T(i)/X(i). we obtam 

I'M 

X(t) 1 + bs 

which is the transfer function of the system, expressed in Laplace trans- 
form terms. 


•When 0 [or t = 0, i.e . when /(«> '!? ««) mart bo rule 

traded from F(.l, as shown to transform pair ( .). Tabic B„. Tie same holds for 
higher order derfvoUveS at t = ■■ *»"» fa (T.). 
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This transfer function (in s) way now be subjected to various input 
functions expressed in Laplace transform terms. 

Example: If a ~ 1 and b = 4 in the above expression, show that the 
output from the system will he y(t) =t~ 4 + 4<r>« when the input 

1 * r I*' T ° tr lhi! • "' C Bnd Y(s) X(s) by the 

transfer fanction Y(s)/X(s). The correct value of X(s) for a ramp input 
ts, from Table B.2, l/s>. The value of Y{.) is, therefore. 


r»- 


s’ 1 + hi 


1 +ll ' 


-+5 + -£_ 

a* • 1 + is 


weStUsI^ ° f Plrtii “ 1 f r‘ i0n! ' “ dKcribed lDr * transform, 
we break this fraction up mto the three fractions 

m - 

and solve for the unknown numerators as before to obtain 
A - 1 , B = —4 c „ +16 

so that 

Y(>) “ 5 - 4 0 + 4 (-i_A 

y(<) = t — 4 + 4e- ,/4 
The output follows the innut in i 

corntan, lag of 4 units (tern, 2, and ad^ng^™ \\^ m h 2 

we wodd’w obuL“d'tfTe P ^tem £7^’' '** *■' wUd > 
difference equation ^ ‘ m had b «n described by the 

it(n)=«(n > -df^n) 

^rentiaM^tfuTr^h p,^. “ ’’ b “ ed 10 Resent the 


vWHi + 
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If C = 1 and d = 4, we have c/(I + d) = 0.2, d/( 1 + d) = 0.8, and 
For a ramp input, as before. 


no- 


z 0.2 

(1 - z)* 1 - 0.8 z 


Az B 
(1 - zy + 1 - z 


+ 


c 

1 - 0.8z * 


Using the method of partial fractions, we find A = 1, B = 4, and C — 4, 
so that 


y(n) = n — 4 + 4(0.8)". 

This result compares directly to the result obtained for the differential 
equation, and gives very nearly the same numerical result, because 
(0.8)" and e~ tM approximate each other closely. 


torn 

e~ ,/4 

(0-8)" 

0 

1.000 

1.000 

1 

0.779 

0.8000 

2 

0.607 

O.WO 

3 

0.472 

0.512 

4 

0.368 

0.410 

5 

0.287 

0.328 

The discrete approximation to 

the continuous case could be improved 


by decreasing the interval of measurement for f(n), assuming the 
transformation of the system could also take place at those more fre- 
quent points. A discussion of this extension, however, must be left to 
the references. 


SOME COMMENTS ON STABILITY CRITERIA 

Suppose that a system receives a "shock” or an impulse. Then, if that 
system is stable, we would assume it would settle down to its initial position, 
and not assume a new condition. This stable system is comparable to the 
weighted doll often used by children, which, when tilted, reverts to its 
former position, although it may wobble back and forth before coming to 
rest. Mathematically, a similar criterion may be established. For example, 
if a discrete time system receives an impulse, we may require that the sum 
of the output pulses emerging from it be bounded, or have some limit. In 
other words, the original impulse will eventually die out, and will not 
continue, or increase in amplitude as time passes. 
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SOME PRINCIPLES OF CODING 
FOR COMMERCIAL USE 


Out purpose here is to Illustrate a few forms of coding used in industrial 
and commercial practice. We Will not consider codes used {or secrecy ot 
transmission, as in Reference 6, nor codes from an information-theory view- 
point, as in Appendix D, nor as a statistical problem, as in References 3, 
4, and 5. 


CODE SELECTION 

In most commercial coding applications it is desirable to have a unique 
representation for an item, event, action, idea, or concept which may be 
described in. many ways, For example, a product code or employee identifi- 
cation number is designed to identify one and only one product or 
employee. 

However, the selection of the particular code that will be used, even 
though unique, offers a classic example in economic trade-offs. On the one 
hand, some capacity or investment in facilities is required to transmit, 
process, and manipulate codes. On the other hand, the capacity required 
in each of these phases of processing is affected by the choice of code 
format. In general, the shorter the code, the smaller the cost of transmis- 
sion, storage, data entry, sorting, and human handling. The longer code, 
however, requires less translation and “look-up" capacity and processing, 
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and often provides greater versatility in data extraction, statistical analysis, 
ana category combination. 

As a result, in selecting a given code format, in terms of the number 
of characters employed, the code structure, and the character set used, the 
over-all System of code use must be considered. 

Example: Consider two airline reservation systems. In the first, originally 
used by United Airlines and designed by Teleregister Coip., a 20- 
character code was used. This system identified flight segment, date, 
and seat available for sale. In this nationwide system, with central pro- 
cessing at Denver, Colorado, and data entry through agent sets at air- 
ports and reservation offices, a 2-second response time from data entry 
to reply was obtained. However, restricting the code to 20 characters 
for fast transmission and processing did not allow storage of customer 
detail, which was often desirable. 

In a second system, designed by IBM and known as the SABRE 
system, a 250-character code was used. This system, originally installed 
by American Airlines, included customer detail, as well as the usual 
reservation information. Although the SABRE system provided much 
greater flexibility of statistical analysis, control of reservations in detail, 
and many side advantages, it was millions of dollars more costly than 
the simpler Teleregister design. To obtain equivalent response time, 
processing and communication facilities had to be much faster, memory 
capacity much more extensive, and switching and queuing problems in 
the communication network were much more severe. Moreover, the 
agent's sets became much more complex and expensive. 

The choice of one system over the other was thus an economic prob- 
lem. The long-range economic value of the more extensive abilities 
provided by the longer code structure and the more expensive processing 
facilities of the SABRE system involved computations of the extra or 
marginal revenue that could be generated through tighter control and 
greater utilization of the seat-mile capacity of jet fleets. To obtain a cost- 
effectiveness evaluation of alternatives, the total system of information 
processing and flight capacity utilization had to be considered. 

In this sense, the problem of code selection is not only a technical 
problem but one of system economics, and coding "efficiency'' from an 
economic viewpoint is seldom the same as the statistical efficiency of 
coding, as defined in Appendix D. Statistical efficiency of coding implies 
efficiency in transmission without regard to use. The economic efficiency 
of a code is largely dependent upon the uses to which the code will be 
put in a larger system context. 

In practice, the selection of a code structure usually involves an 
evaluation of a' number of alternatives originally proposed for a specified 



433 Appendix C 

The danger with this procedure is that -«**£"* 
may not be uniquely described by the block now chosen, but may duph 

^ ^ Twcfstandacd variants between the serial and block code extremes fre- 
tjueotly are used to obtain the advantage of compactness and some ease o 
extraction and interpretation- . Wff _ 

First, we may block the serial code if the item list is not likely to change 
in length or contents. We do this by sequencing the items listed in some 
meaningful order before coding. Thus, suppose the employee names are 
first alphabetically sorted tcithin given plant locations, in the sequence 
New York, Chicago, Los Angeles. Then, when reading or extracting the 
serial code, we know that all employees with a serial number greater than 
“3” work in Los Angeles, 

Unfortunately, this code format becomes obsolete if a change in the 
item list occurs. This complication can be alleviated by reserving "empty 
spaces in the original assignment of serial codes, but, because it is difficult 
to predict where items will be added or deleted in the future, such "tight 
coding is an undesirable alternative for lists that are apt to change. 

The second alternative is to serialize the block code, the procedure 
used in the Dewey Decimal System employed in libraries. In this approach 
we block the major categories of interest and cany a trailing serial number 
to provide unique identification. In our example, we could use a code 
structure 

Position Category' 


Plant location 
Marital status 
Serial 


e.g., ADAMS could be called #311, indicating he lives In Los Angeles, is 
married, and has the arbitrary serial number “l” for that category com- 
bination. 

The serialized block code may be considered the most popular of the 
wen-known formats because updating and file maintenance are easy. To 
add or delete an item we adjust the trailing serial number. The major cate- 
r d0Wn ? remain unc Wed. As conditions change, our 
beCOnW obW,c!e and Iet J*rire a complete 
renumbering or recoding of the hies with which we work 

of A.'nSbS' COde " fS'aai'y eao be increased by a careful analysis 
of Ac needed categories, the symbols used for coding and the methods 
of file operation and data- handling to be used. 

Ac Tf e ' 1 °° ! ? ,g ? °" r ciam P Ie mi sht suggest combining 

Ac categories plant location" and "marital status" into one decimal posf- 
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tion when he sees a total of nine characters (less than the ten decimals 
available) would do the job: 


1 New York Married 

2 New York Single 

3 New York Divorced, etc. 

4 Chicago Married 

5 Chicago Single 

6 Chicago Divorced, etc. 

7 Los Angeles Married 

8 Los Angeles Single 

9 Los Angeles Divorced, etc. 

Here we have serialized within a block to achieve a shorter code with 
only a small sacrifice in analytical ease. 

Finally, we should note that symbols other than decimals can obviously 
be used. Alphabetic characters would give 26 subclassifications for a given 
code position, so that alphabetic and alpha-numeric codes may be desirable 
if they can be handled by the data-processing facilities to be used. In 
general, the maximum number of distinct codes obtainable with a combina- 
tion of character types will be the product of the characters available at 
each code position. (With block coding and its variations, the usable com- 
binations will usually be less than the maximum figure.) 

It is interesting to note that block code length can be as short as a 
serial code if each and every available combination in the block code is 
used. For example, we may have the following items for which only two 
characters are available per position (binary code), i.e., 0 and 1. We Have 
the following items on our list. 

SERIAL BLOCK 


A married female 
A married male 
A single female 
A single male 

As shown in the table, the serial code can 


00 00 

01 01 

10 10 

11 11 

be made equivalent to the block 


code and vice versa. 


HIERARCHIES IN CODE STRUCTURE 
So that items may be identified, handled and 

oi rr ,i|*it - " -SET "hy necd^not neccTsarily 
form descnbed m d characters in a code can always be 

be positional, since ^“frequently lenient to or S anizc a code from 
re-ordered. However, it is ircquem*> 
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left to right, so that the left-most character is the most general class, and 
positions to the right represent inferior subclasses. The postal ZIP Code 
and the bank code used to sort cheeks are cramples. The five-digit ZIP 
code gives from left to right the postal division of the country, the distribu- 
tion renter within the postal division, and the local post office within the 
distribution center classification. Similarly, the bank code gives the Federal 
Reserve drstrrct. Bank identification. Branch, and Customer Account Num- 

^hTbo.^X 2ck erarChy °' imP ° rtanCC readi " S fr ° m ' Cft ,0 riBht 

m ‘ d in °' her ,ami,iar applications. Thus, maga- 
ame mailing hsts, which run rnto millions of names are usually coded for 

z?pTJeTvS ing 'Sf* A -de sKIh £ 

by S„*nd ftTSl " P mai,i "2 bunt], “. !,a,k,icaI analysis 

chafers fadesc^diivlA " i,b “ n “ mber ”f alphanumeric 

tenanre procedures usclby TIME h!r rtanCe w e C ° ding a " d lbl main ' 

setting up such a svTt™ Iv . ■ be fou " d Reference 14. In 

to be selective are usually made Fn **** yS ? °f ,be “'egories most likely 
sorting or searching, the second letteToT'rhe T T . eC ° ri ‘ nS " ames for la,er 
little selective value, and is therefore f ^ “l " ame usuall >' lias very 
identifying block code. References 1 in . deve,0 Plng an 

hierarchies in some detail. ’ ’ nd ^ discuss the problem of 


dangers «n coding and error detects and correction 

Often such er^Sfe “ m ° T ™ y ^ deduced. 

Should 6 Sa ' esman ' s commission is cS a ” a f a >™» applica- 

Should an appropriate D rede be ** B • ** C - l«i D, 5*. 

V y Tv 00 '’''"' P a fo h 'ok would be as an A, a substan- 

hrng that pay pe ri od|). Various m«h^t t* a" Salesman has sold any- 
fetefore used In mM Lumerda, such errors are 

pose P Fo S A coni Puted check digit “ comn 'unication and 

PO^. For other devices, see Referrere f typiCal device fo r <his pur- 


t — - 

1 " ,forml y Present for all iLns® nm " ber ° f "W 
• a more serious probl 
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of code development and application exists. See References 1, 2, 12, and 
13 for a discussion of this problem. In such cases, it is common to use hey 
words, phrases, concepts, or subjects derived from the text of a document 
or from a standard list of possibilities to build an indexing code for each 
item. The same procedure is used for item requests. In this more general 
situation uniqueness of description is not assured, nor can it be assured 
that the request for items specified by a given set of descriptors will be 
found in the file. The references cited give various approaches to this as 
yet unsolved problem. Reference 13 illustrates the fact that such problems 
are not only those of librarians, but also of commercial users of large data- 
banks of intelligence information. 


THE USE OF CODES IN INTEGRATED DATA.PROCESSING SYSTEMS 

The basic principle of integrated data-processing systems is that once in 
formation is entered into a system, maximum use should be made of it 
without further data entry. This means that input data and files must be 
structured to exploit the common features of both. , 

For example, the most common form of commercial integrate a a 
processing starts with an order for a product as an initiating acumen . 
This original input then initiates a chain of events-credit checks, inventory 
checks, production ordeis, purchase orders, cost accounting, production 
planning, invoicing, accounts receivable, shipping, and other operations 
terminating the orders execution. Since many of these activities involve 
files, computations, and decisions that can be keyed to the item and the 
customer description, a code is often developed to organ.ee files and com- 
putations in these basic categories. 

The form of this organization is best illustrated by an example. 

Example: Old John Distillers. Ltd., is a major coiporatior .which .acts 

as a holding company for numerous smaller corporations engaged rnfte 

distillation, blending, bottling, anddistr.hutionofspniUn Old John 

controls " basm 

stsXn"' -ah one?d m“ blcs to galions in various increments, 

SSbSriSSXS similar variations,, and distribuies in up ,0 1000 

subclassificationsc combination, the code 

° TJJX c . la ^ been proposed. The product block code 

f h Z in thefi^t toe is the key to the system of later data proecssmg. 


Package Code 


Special Sales 
Pack Data 


fol 1 1 °) ’ l ° l ’ 


[0 


LLi 


lateral CoasI 0 i 1 I 0 1 ' 1 0 I ' 

l o l r 1 o j 1 1 


part 

Function 

Serial 

No 

| Part Ust 
l Bottle 

Package 0101-07012 

Of-OtOt-CTOt 

1 Pack Ust 

Can 

02-0199-0701 

j 0101-07D12 

| Front Label 

00-0101-0701 

0102-07D01 

1 B3ckL2bel 

04-0199-0701 

0109-07D12 

1 Shoulder Label 0^0199-9902 1 

0107-07D12 

Foot Label 

Canon 

OrCS 01-9301 1 
07-0101-0701 



Package Specifications 


Pan Cross Reference (For all parts showing 
a 9 anywhere in the part code) 


C-1 A code for Integrated data processing In a given application, (a) product and 
materia' codes, which are structured for Integrated data processing, (b) A typical 
cross reference available from the structured codes. The example, discussed In the 
fext, is a package and material code application for a distillery. The package code 
uniquely Identifies each product sold, and provides lor statistical analysis by financial 
account, spirit type, brand, size, market, area sold, etc. The material code, related to 
this basic structure permits comparison o! package designs, analysis for common 
materials, etc. The basic code structure can also be used In other forms of record 
keeping and computation In cost accounting, production planning, and marketing. 


THs 12- position code is defined as follows; 


Position 

2j> 

3.4 


55,0 

10.11.13 


JW 


Meaning 

Corporation identifier (45 categories) 

Brand identifier (60 brands) 

Position 3. spirit type (g types) 

Position 4, serial number 
Bottle size (26 sizes) 

Domestic and export code (up to 26 letters) 

Special package identifier (up to 100 as needed) 
Customer code 

Position 10, major area within domestic and 
export class 

Position II, subclass within major area 
Position 12 , customer serial within area subclass 
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Using these definitions, a particular order from a given customer 
would have the same code identifiers in positions 7, 10, 11, and 12. An 
order from a customer might then read as follows: 


Customer D376 
(common to all 
items ordered 
at right) 


Number of cases 
120 
60 
72 


Product type 
010107D12 
022106D06 
117208D31 


This order could be entered into the system by means of a stan- 
dardized form, or sent by teletype from the customer to the factory for 
processing, thus avoiding a second typing operation. 

Once the coded information is received, the customer and product 
codes provide keys to files that are needed for later operation: 


Code key 
Customer Code 
(keyed to 
D376) 


Product Code 
(keyed to 
product codes 
ordered) 


Files 

Credit 

Accounts Receivable 
Statistical Analysis 
Name and Address 
Shipping instructions 
Inventory Status 
Production Specifications 
Statistical Analysis 
Cost accounting 
Production Planning 
Materials Needed per unit 


Cross references between the two sets of files are possible using the 
combined customer-product code. 

Moreover, if product materials, work centers, and employees are further 
identified by a code, further cross-references can be built up. One example 
is illustrated by the second line of Figure C-la and Figure C-lb. In this 
example, the materials used in the bottling operation have been coded 
by function, corporation, brand, size, and serial number. These materials 
can be associated with a given product by reference to the corresponding 
blocks of the code, as shown in Figure C-la, and a cross reference showing 
product-materials or material-packages can be prepared. In the material 
code, a “9” is used to indicate common usage on several product types. The 
material code allows financial analysis by material type, corporation, brand, 
and size, and the speciEcations, shown in coded form, permit the computa- 
tion of materials required for a given production run. 

As an illustration of a side use of such files, it will be seen that a listing 
of material types in numerical order (from left to right) will place together 
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INFORMATION MEASURES 


A source of information is characterized by the variety of its output. If a 
source can emit N distinct characters, then the variety that can be generated 
is N. It is convenient to measure variety in logarithmic form, to the base 2. 
When variety is so measured, the number resulting represents the number 
of yes-no, or binary questions, that would be necessary' to determine which 
one of the A T characters had been emitted, if the A' characters were emitted 
at random. Thus, variety in binary digits, or bits = log. N. Remembering 
that log, .V = 3.322 log,, X, since log. X = (log, X)/( Jo& b), we may com- 
pote the number o! bits, using logarithms to the base 10 as usually tabu- 
lated- 


For example the variety of the sexes is 1 bit, since log, 2 = 1. The 
™ 'S"* 1 “f 45 U . 5 - 7 bit5 ’ since log, 52 = 3.322 log,, 52 = 

cIo S arilfl ™‘c measure has advantage in converting 
Wle ml il^T sira P ,e Thus, if a male or a 

made of the form t *jT nun, ber of distinctions that can be 

*e t ITL T 7 ° f « 2 >152 =101, or more simply, 

ihe vanetj ran be measured as 1 + 5.7 = 6T bi(s 

posrihle "I K 7"'',' 3 sWn 8 °f characters it is 

ttrtng. If these frequencies are stal? ’.T?* cl,arac,cr ’ s appearance in the 
by « gis-en »mp4, “? Hc " 1lh t,mc (stationary) when collected 

? " pTK "'' ™>- compute the probability of each 
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dwacter’s appearand from these data and call that set of N probabilities 
to # 3 ’ * ’ ’ * Fr0m the character probabilities it is now possible 
in te G f a m ? asUrc ^* e actua l information transmitted by the source 
I 0 _ var J e ty and compare this measure to the potential variety, 

& . hi s probabilistic measure is called the entropy of the source. 
t° T example, a source emits four characters, A, B, C, and D. Its char- 
er vanety is log, 4 = 2. The entropy of the source H is defined as 

~ Pi log p, — p 2 log p t — p 3 log Pi - p 4 log p, 

^ our ' c ^ aracter source). The negative signs are used in the definition 

o am a positive result, since the values of p, will be a number less than 
unity. 

The entropy measure takes on its maximum value when all characters 
m emitted with equal probability, i.e when p, ~ 1/N. Thus, for the four 
®racter case, this maximum is 

= (4) ( —i) (log 2 i ) — log 2 4 — log 2 1=2 hits, or log, N. 

If the characters are emitted with another probability distribution, the 
W t f ,e source will be less. For example if p{A) = 0.4, p(B) = 0.3, 

/ — 0.2, and p(D) = 0.1, the entropy of the source is 

H ~ - 0-4 log, 0.4 - 0.3 log, 0.3 - 0.2 log, 0.2 - 0.1 log, 0.1 
or 


: 3.322 ( - 0.4 logio 0.4 - 0.3 log, 0 0.3 - 0,2 log* 0.2 - 0.1 log 0.1 ) 


llc ^ u pon evaluation, using a table of common logarithms, is 1.846 bits, 
ote that log lo 0.4 = log 10 & = log to 4 — logio 10. We thus look up log,,, 4 
the common logarithm table and get 0.6021. Since log 10 10 = 1.0000, 
ogio 0.4 = 0.6021 - 1.0000 = -0.3979. SubsUtution of these values in the 
3 , °^ e expression for H gives the required result, which is positive because 
0 t e negative signs in the definition. Thus, 


H * 3.322 ( 0.4 X 0.3979 + 0.3 X 0.5229 + 0.2 X 0.6990 + 

0.1 X 1.0000) = 1.846 

fact that a source emits with a probability distribution that is not 
equally likely for all characters is another way of saying that there is some 
C ° n ^ a * nt ln ^ an S ua S e used. 

The efficiency of the language in transmitting information, as measured 
y the entropy of the source, is 


Efficiency = H/H m „ — ///log, N. 

F ° r Ibe four-character source just discussed, the efficiency of messages 
generated by characters emitted at random according to the specified 
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come and go, the consistency, completeness, and relevance of the codes 
used in current application suffer greatly. 

Since a major commitment in system design is made at coding time- 
one that affects system cost and usefulness for generations of later decisions 
-the expense required to obtain complete documentation, documentation 
updating, and long range documentation planning is easily justified. 

In short, although the task is expensive, often plodding and tedious, and 
frequently a psychological burden for the systems analyst with expansive 
plans for the future, the importance of adequate documentation of a cod- 
ing system and its use cannot be overstressed. Although the aphorism “A 
job worth doing is worth doing well* is wom out by now, it applies with 
certainty to the job of code development and maintenance. 
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characters appearance from these data and call that set of N probabilities 
Pi(i = 1, 2, 3, , AT). From the character probabilities it is now possible 
to compute a measure of the actual information transmitted by the source 
to terms of variety and compare this measure to the potential variety, 
log* N. This probabilistic measure is called the entropy of the source. 

For example, a source emits four characters, A, B, C, and D. Its char- 
acter variety is log? 4 = 2. The entropy of the source H is defined as 

~ Pi JogPi - P 2 logp 2 - p 3 1 ogp 3 ~ p t log p 4 

(for the four-character source). The negative signs are used in the definition 
to obtain a positive result, since the values of p t will be a number less than 
unity. 

The entropy measure takes on its maximum value when all characters 
are emitted with equal probability , i.e., when p, = 1/A 7 . Thus, for the four 
character case, this maximum is 

= (4) ( —X ) (log, i ) = Iog 2 4 — log* 1 = 2 bits, or logs N. 

If the characters are emitted with another probability distribution, the 
entropy of the source will be less. For example if p (A) — 0.4, p(B) = 0.3, 
P(C) ~ 0.2, and p(D) = 0.1, the entropy of the source is 
H = ~ 0.4 log, 0.4 - 0/1 log, 0.3 ~ 0.2 log, 0.2 - 0.1 log, 0.1 
Or 

H - 3.322 ( ~ 0.4 logjo 0.4 - 0B Iog JO 0.3 - 0.2 Jog l0 0.2 - 0.1 log , 0 0.1) 

which upon evaluation, using a table of common logarithms, is 1.846 bits. 
Note that logic 0.4 = Iog 10 = Iog, 0 4 - Iog t0 10. We thus look up log,„ 4 
in the common logarithm table and get 0.6021. Since log,<, 10 = 1.0000, 
logjo 0.4 — 0.6021 - 1.0000 = - 0.3979 . Substitution of these values in the 
above expression for H gives the required result, which is positive because 
of the negative signs in the definition. Thus, 

H ~ 3.322 ( 0.4 X 0.3979 + 0.3 X 0.5229 + 0.2 X 0.6990 + 

0.1 X 1.0000) = 1.S46 

The fact that a source emits with a probability distribution that is not 
equally likely for all characters is another way of saying that there is some 
constraint in the language used. 

The efficiency of the language in transmitting information, as measured 
by the entropy of the source, is 

Efficiency = H/» m „ ~ ///log, N. 

For the four-character source Just discussed, the efficiency of messages 
generated by characters emitted at random according to the specified 
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K source o£ information is characterized by the variety of its output. If a 
source can emit N distinct characters, then the variety that can be generated 
is N. It is convenient to measure variety in logarithmic form, to the base 2. 
\Vhen variety is so measured, the number resulting represents the number 
of jes-no, or binary questions, that would be necessary to determine which 
one of the S characters had been emitted, if the N characters were emitted 
at random. Thus, variety in binary digits, or bits = log* N. Remembering 
that log, N = 3.322 log,, N, since log* N = (log, N)/(log, b), we may com- 
pute the number of bits, using logarithms to the base 10, as usually tabu- 
lated. 

For example, the variety of the sexes is 1 hit, since log; 2=1. The 
variety of 52 playing cards is 5.7 bits, since logi 52 = 3.322 logt 0 52 = 
3.22 y 1.760 = 5.7. The logarithmic measure has advantage in converting 
rmiltiphcaUse combinations into simple additions. Jhus, if a male or a 
female can select one of 52 cards, the number of distinctions that can be 
made of the form "male: Aec of spades" is 2 X 52 = 101, or more simply, 
the variety can be measured as 1 + 5.7 *= 6.7 bits. 

If the source S with a character set N emits a string of characters it is 
possible to determine the frequency of each character’s appearance in the 
string. If these frequencies are stable with time (stationary) when collected 
by a gi\en sampling process, we may compute the probability of each 
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character’s appearance from these data and call that set of N probabilities 
Pi(* = 1, 2, 3, , N). From the character probabilities it is now possible 
to compute a measure of the actual information transmitted by the source 
in terms of variety and compare this measure to the potential variety, 
log, N. This probabilistic measure is called the entropy of the source. 

For example, a source emits four characters. A, B, C, and D. Its char- 
acter variety is log, 4 = 2. The entropy of the source H is defined as 

— p r log Pi — pi log Pi — Pi log pi — p4 log p« 

(for the four-character source). The negative signs are used in the definition 
to obtain a positive result, since the values of p, will be a number less than 
unity. 

The entropy measure takes on its maximum value when all characters 
are emitted with equal probability, i.e., when p, = l/N. Thus, for the four 
character case, this maximum is 

hfjn** = (4) ( —~i ) (Iogj i ) = lo g 2 4 log: 1 = 2 bits, or log_,N. 

If the characters are emitted with another probability distribution^ the 
entropy of the source will be less. For example iS p( A) — 0.4, p(B) — ■ , 
P(C) = 0.2, and p(D) = 0.1, the entropy of the source is 

H = - 0.4 logi 0.4 - 0.3 log, 0.3 - 0.2 log, 02 - 0.1 log, 0.1 


H = 3.322 ( - 0.4 log,, 0.4 - 0.3 log,, 0.3 - 0.2 log,. 02 - 0.1 log „ 0.1 ) 

which upon evaluation, using a table of common logarithms, is I.S46 bits. 
Note that log,. 0.4 = log,. A = log,, 4 - log,. 10 We thus look up ogu. 4 
in the common logarithm table and get 0.6021. Since log,. ; ■ 

log,, 0.4 = 0.6021 - 1.0000 = -0.3079. Substitution of these values m the 
above expression for If gives the required result, which is positive because 
of the negative signs in the definition. Thus, 

H = 3 322 (0 4 X 0.3979 + 0.3 X 0.5229 + 0.2 X 0.6990 + 

0.1 X 1.0000) = 1.846 

The fact that a source emits with a probabili* distributi on that 1 Is not 
equally likely for all characters is another way of saying that there is 
constraint in the language used. 

The efficiency of the language in transmitting information, 
by the entropy of the source, is 

Efficient;)' = IUH„, = 7//log, N. 

mr-character source just discussed, the efficiency of mcs5a |“ 
characters emitted at random according to the specified 
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character s appearance from these data and call that set of N probabilities 
P%(i = 1, 2, 3, . . . , 2V). From the character probabilities it is now possible 
to compute a measure of the actual information transmitted by the source 
m terms of variety and compare this measure to the potential variety, 
log z N. This probabilistic measure is called the entropy of the source. 

For example, a source emits four characters, A, B, C, and D. Its char- 
acter variety is Iog 2 4 = 2. The entropy of the source H is defined as 

" Pt log Pi - p 2 log p 2 - p 3 log p 3 - p t log Pi 

(for the four-character source). The negative signs are used in the definition 
to obtain a positive result, since the values of will be a number less than 
unity. 

The entropy measure takes on its maximum value when all characters 
are emitted with equal probability, i.e., when p, = l/N. Thus, for the four 
character case, this maximum is 

tfnux = (4) ( -i) (log 2 i) = log* 4 - log 2 1=2 bits, or log 2 N. 

If the characters are emitted with another probability distribution, the 
entropy of the source will be less. For example ii p( A) = 0.4, p(B) = 0.3, 
P(C) = 0.2, and p(D) = 0.1, the entropy of the source is 

H = - 0.4 Iog 2 0.4 - 0.3 log, 0.3 - 0.2 log, 0.2 - 0.1 log, 0.1 
or 


H - 3.322 ( - 0.4 Iog 10 0.4 - 0.3 log 10 0.3 - 0.2 log 10 0.2 - 0.1 log ,* 0.1) 

which upon evaluation, using a table of common logarithms, is 1.846 bits. 
Note that Jog 10 0.4 = log 10 A = Iog,» 4 - log l0 10. We thus look up log 10 4 
hi the common logarithm table and get 0.6021. Since log 10 10 — 1.0000, 
logio 0.4 = 0.6021 - 1.0000 = -0.3979. Substitution of these values in the 
above expression for H gives the required result, which is positive because 
°f the negative signs in the definition. Thus, 

H = 3.322 ( 0.4 X 0.3979 + 0.3 X 0.5229 + 0.2 X 0.6990 + 

0.1 X 1.0000) = 1-846 


The fact that a source emits with a probability distribution that is not 
dually likely for all characters is another way of saying that there is some 
constraint in the language used. , 

The efficiency of the language in transmitting information, as measureo 
hy the entropy of the source, is 

Efficiency = H/H m „ = H/ log, N. 


For the four-character source just discussed, the efficiency 
generated by characters emitted at random according to 


of messages 
the specified 
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praU Wtty distribution is I.S46/2.000 = 0.923. The language redundancy 
is the complement of efficiency 

Redundancy — 1 Efficiency 

Thus, the language generated by the four-character source above has a 
redundancy of 1 — 0323 or 0,077. 

A word about the mterpretatiou of entropy as used here: The entropy 
of the source is a statistical aserage of svhat may be cal ed the cbaracte 
entropy, A„ Thus, fa the abose problem, the character A may be satd to 
Sse a chiacte/entropy of - (3-322) log,. 0.4 = (3322) (03979) = 133 
bits. Similar evaluations may be made for B, C, and D. However, the source 
emits these characters with given probabilities, so the entropy of the 
source is the weighted sum, or average value 
x x 

H * — £ pA. s -Ep. h>g.p, 


The source is thus said to emit an average of 11 hits per symbol. Notice that 
character entropy increases with the rareness of the character in the emis- 
sion, as indicated by p t . The entropy of the source, which refers to the set 
of symbols generated, provides a measure of the disorder of the emission 
or the unpredictability of the characters emitted, and is thus maximum, as 
we have seen, when the emitted symbols are equally likely. Another way 
of interpreting this result is to say that the source transmits the greatest 
information per character when the characters themselves are equally 
likely to occur- This result has great importance in increasing the efficiency 
of an information source, because by appropriate coding procedures, the 
emission from a source may be made to approach this idea) “maximum 
information’ condition. 


So far w e have discussed a source that emits “monograms" or single 
uncorrelated characters. We could also have considered a source that emits 
digrams, or pairs of correlated characters. For example, due to the struc- 
ture of 3 language, the sequence of letters in a word is not random, but 
coEotramed, i4i Encash a “q" is always followed by a '‘u" and a ‘f is likely 
to be followed by an “h." Markov processes, in which the probability of 
transition from one state to another is the conditional probability' p(f.i), 
arc another example of twixharacter correlation. 

In a way exactly analogous to the monogram calculation, it is possible 
to compute digram entropies— and indeed to compute entropies for 
longer strings of correlated characters, common in many human languages. 
The reader is directed to the references for these computations. However, 
wc note that the addition of statistical correlation between parts of a 
mewage always tends t o reduce the information value per character, and 
Uwrteiore the entropy of the source. {To illustrate, newspaper English, taken 
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on a monogram basis provides about 4.14 bits per character. The inter- 
pretation of this number is that instead of a 26 character alphabet, another 
character set containing 18 symbols, giving log 2 18 = 4.14, would do the 
same job if these 18 characters were used with equal probability. However, 
when diagram correlation in the English language is taken into account, 
the computed information value per character drops to 3.56 bits, equivalent 
to a new set of only 12 equally probable symbols. Although the computa- 
tions rapidly get out of hand for very long strings, Shannon has concluded 
(Reference 3) that “The long-range statistical effects (up to 100 characters) 
reduce the entropy [of common English] to something of the order of one 
bit per letter, with a corresponding redundancy of roughly 75 per cent.’* 
Thus, in conclusion, for a source to be most efficient— for it to transmit 
the largest amount of information per character— the sequence of characters 
must be uncorrelated and the symbols themselves must be equally probable 
in the string. Coding procedures to achieve these conditions are often used 
to improve the efficiency of an information source, as defined here. Such 
procedures also have an important bearing on the secrecy of transmission, 
because messages coded for maximum efficiency do not provide code 
breaking guides that can be gained from a knowledge of language structure. 
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SUGGESTIONS FOR A STUDY 
OF YOUR HOMETOWN 1 


(Editoh's Note): Because of the importance in applied anthropology of early 
field training, preferably in the second > ear of concentration, it is obviously 
necessary for this land of training to take place in and around the University. 
Consequently, ue believe that our readers who are concerned with teaching 
programs will be interested in this memorandum which Dr. Lamb gave to his 
students at M.IT Needless to say, the memorandum is primarily concerned with 
urban studies and should be considered not as a complete description of field 
technique, but rather as points of emphasis with which the student might not 
oUierwise be familiar) . 

This memorandum is written as if you were visiting Hometown for the first 
tune and as if jour company or organization had instructed you to arrive as 
quickly as possible at a comprehensive knowledge of Hometown so that you 
might effectively represent it tlicre. Towards the end of the memorandum I 
sliall have something to say about the advantages you, with your long experience 
in die community, would have over a newcomer in your otm Hometown. 

To do tins |ob of community-analysis there are certain tools you will obviously 
need A map of Hometown is your first tool, for a bnef glance at it will provide 
die trained eye with more facts than could be secured from any odier source. 
(This of course depends upon tl»e map; most street maps are featureless without 
an accompanying street directory ) . 

1 liy Robert JI Lamb (deceawd) Reprinted from Human Or&inizatum, Summer, 1952, 
by p<TmikNicn c k tb Scutty for Applied Anthropology. 
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- “ 1 ^’ AmeT V n — ““ -» even 

*h iriVKUgZn as Z rt ' .7 "* “ “Usable pa[t my 

directory: (1) the alnhahrtf T ^ t ^ C P f nnCipal divisions of the average 
organisations etc? /Iwt f ",T. “ Cti0 " f ° r Mvid “ aIs “ d ■“>*>■» 
usually each senaiate fo 1 dlrec, °' y ^g <*ch house or bufiding, and 
classified advert? 1 , y or busme “ occupant of such buildings: 13) ih e 

duc. s ot d se a ctr“ g „r c r- * not 8)1 -* «•£ « L£ 

arranged l conta “’ ia B about the town statistically 

>» Se clfy Ztm meT “ °'? er S0UrC “ : ‘ here “ ^ ab ° * sechon devoted 

of deta.1 I ou t nSnneTf h PrmCiP ° h °' derS ' 0,111 ° f ' en a ^ d “' 

F tin r n personneI of the various city services. 

of tbadhT 0rientati “" i[ ff y ““ can find a series of older maps 

ckaracten^Sci^th' ^ ^ " ‘ ha ' y °“ “ 

‘0 leM°m„?? U 7 aV ?I° n<i T ch furtber ™ lh your investigation it will be advisable 
t°oh one n ° Ut earIier mIiabi tants. For this purpose you will need another 
county-widp ^ V s O U T e i ° f , ! 0Cal hkt0ry ’ usudI y 10 be f ound in the form of 
devoted tn n, . Cmmts °\ * e ^tory and biographies of the area, with sections 
Ot} t0wns ^d cities in each county, 

and see wS be susgested lato - •>“* »= shaU start with those mentioned 

social structure * WCek m tOWn W ® couU amve at “V understanding of its 

to tynjcaf S ^ t count y history. Here we shall assign fictitious names 

the nan* C f ^ oonor or blter ^ die county history we will come across 

perhan« e ° in.** 1 Ear * y wbo was connected with the Early Trust Company; 

- - P , W f sba d a “ 0 b e able to establish that William A. Newcomer married c — 


rt._*^ e girls. If we can also connect the Early and Newcomer families with 


(jj e r j . e aiso connect tiie isariy and Newcomer . 

nine °» In 3tl0n Hometown Manufacturing Company we have 


a good run- 


*«■{« ° n a rec °ns traction of the way people have earned their living in Home- 
matt °^ any ^ ears ‘ "^us is, of course, a roundabout way of approaching the 


have j proDaDiy in tiie county history many names ot families who 

^ine T? Ve ° awa y» and others whose names have died out, at least in the direct 
ough middle and even first names may survive, 
in H 6 ^ U * c ^ cest wa y to find out where the major economic decisions are made 
McN 1M OWn ** to to tbe public library and ask to see a copy of Rand 
b ankA & an ^ ers ’ Register (if the hbrary does not have it, go to one of the local 

°f n hr , ere ^ ou "ufi find the names of all the Hometown banks as of the year 

£* e at j Cat *° n ' bovvev er, because of the mortality of banks after 1929 there are 
jQ 2 g ^” anta 8 e s in looking up two volumes: the current one and one prior to 
Her a * e this list of banks and look up another publication, Moody’s Banks. 
yfiQ* ^_ 0U 'yifi find the names of the directors of your Hometown banks, together 
tnb f a °f die mergers which have taken place and which have con- 

- u ed to the present condition of these banks. In this book there will also be 


found 


a current bank statement of assets and liabilities, and probably the dividend 


c J^^dier set of tools becomes necessary: a card file. Some people prefer 3 x 5 
03 be c a use they are handy to cany; others use 5" x 8" because they provide 
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more space for notations. Use one card for each bank and record the names of the 
directors on the ruled side and the history of the bank on the reverse, unruled 
side. Then make a separate card for each bank director, with liis name (last name 
first) in the upper left-hand comer. Put his address if it is available on the top 
right, and his directorships (with his principal connection on top) in the 
middle. On the reverse side of the card record his personal history, date of birth 
and parentage at the top, marriage, children, education, etc. These cards should 
be alphabetically filed in the boxes in which the cards were bought, thus dis- 
pensing with the added expense of buying special files. 

Card files become your most important source of information, and you will 
find that it is necessary to cross-reference them constantly. They constitute the 
basic difference between our approach and that of the census-taker: We are 
primarily interested in individuals and their patterns of relationship, while he 
is interested in the overall statistical aggregates. 

With the list of bank directors in hand, turn now to another tool volume: Poors 
Register of Directors. Here are listed all the most important corporate directors in 
the United States. These men do not always record their directorships, but the 
listing is as complete as the editors are able to make it on the basis of their 
investigations. From Poor's Register you can find the other corporate directorates 
held by Hometown bank directors. This will lead you at once to the names of 
all important Hometown businesses, for banks tend to accumulate to themselves 
the leading financial and business talent in town. This method of analysis will be 
successful provided Hometown’s bank is not part of a chain of banks which 
merely maintains a manager in your town. But unless your town is very small 
it did once have a bank, and not so very long ago, so that a study of earlier 
editions of the volumes cited above will reveal the names of Hometown's bank 
directors. 

Such volumes as these may not be available in many cities and towns but if 
there is a university in the neighborhood its library may carry them; they should 
also be found in large city public libraries. In many cities there are often business 
libraries attached to the Chamber of Commerce, or elsewhere, and usually your 
Hometown banks subscribe to some or all of these publications. If, among its 
successful sons who have moved away, your town numbers a broker in some big 
city or a metropolitan banker, a letter to him may bring your library a copy of 
one or more of these volumes which although perhaps a few years out of date 
will still be useful for your general purposes. The names of the directors of your 
banks for the current year can be ascertained by asking the local bank for a copy 
of its annual statement, which is usually published in a small folder on the back 
cover of which is a list of the directors. Business connections which are not 
corporate, such as partnerships or firms, will usually be found in your street 
directory. 

Turn now to the Hometown Manufacturing Company in your directory where 
the names of its officers and chief supervisory employees will probably appear. 
This will enable you to plot on the map the residences of superintendents or 
foremen, and operating management. You will find that while only the top men 
live on High Street or in “Hills and Dales " the others often tend to live as close 
to these neighborhoods as they can afford, the older men nearer High Street, the 
younger nearer “Hills and Dales.” 
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who wn 1, ■ f f f ^r^ctories also designate the place of employment of those 
“u r m the Hometown Manufacturing Company by some such mark as 
resirf r t ^° r exain f^ e - This will enable you to plot on the map the 
worl- enCeS 1 . 6 em P !o > retl at the factory. When the craft or special skill of the 

, JS indicated, as it is in most directories, you can End out where the 
as Cr . grades of workers live. (It should be understood that the word “grades," 
use ere, has a purely technical usage, such as “machinist" or “foremen," and 
. a aie ‘^ ls Q f distinguishing the probable wage received, since this tends to deter- 
une what rent workers can pay, and other economic facts we are trying to 

establish). 


nee you have begun to accumulate cards for individual residents m different 
P of the city, and have marked their cards according to their occupations, 
ouareon the way to the preparation of a residential map of the city. It will be 
a viable to devise a system of symbols to designate the different income grades 
111 occupational groups, and attach them to individual residents on the map in 
c er to show the approximate location of their homes on the streets of the city. 

tiddentally, if you are enterprising, one map bought from the local book- 
se er who handles the Hometown street directories will enable you to trace any 
number of copies on transparent paper. Once you have plotted on your maps the 
ocation of your bank directors, corporation directors, superintendents, foremen, 

, workers in the factory, you are ready to branch out into a n eigh b orhood-by- 
neighborhood investigation of Hometown ; this wj U include its churches, neighbor- 
hood and nationality clubs, formal and informal social groups, political and bust- 
ness groups, etc. 

Such an investigation will lead you back in the direction of your census 
materials, which are also to be found in the public library. But the model for 
the investigation is not to he found in the census, but in the publications of social 
Workers. For information about census tracts it will be necessary to approach 
someone connected uith a public or private charity, or a governmental office 
dealing with welfare or relief. In many cities (and their number is increasing). 
Students have made area studies of the different neighborhoods, showing the 
number of cases of one sort or another falling in a given area, types of sickness, 
kw violations, social maladjustments such as juvenile dehquency, have been 
Indicated area by area on city-wide maps. This is one of the most fruitful parts of 
any study you may make of Hometown. However, these findings must be con- 
sidered very carefully and you should avoid arriving at a hasty conclusion on 
the basis of any one type of case as it appears on the map- 

From this wealth of material, a comprehensive pattern of neighborhood dis- 
tribution of these various groups will emerge. By turning to the street section of 
your Hometown directory you may be able to discover whether or not the resident 
is a home-owner and sometimes whether he has a telephone. Tins information 
will help to confirm your guesses about income status. By working back and Forth 
between the street section and the alphabetical section of your volume, you wall be 
able to verify your sampling on a street by street basis. In this way you will 
soon know a great deal about the distribution of Hometown population— group 
by group- and income-class by class. If your company were interested, for 
example, in a door-to-door selling campaign, this information would be of con- 
siderable help. 
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For some purposes it may be necessary for your company to know the national 
origins of different groups in the community. In past times many of these groups 
tended to live in their own separate neighborhoods and to develop separate 
national institutions, such as churches, parochial schools, clubs and organizations 
of one kind and another. Increasingly these distinctions arising from national 
origin are breaking down in this country. The younger generation is tending to 
move out to newer suburban developments where they mingle with people of 
other national origins. This is part of the "melting pot” of which the Americans 
are so properly proud. A study of your Hometown as prepared by the method 
described above will reveal, however, that the "melting” process is far from 
complete in many communities. 

Any study of local politics will reveal that politicians are highly aware of 
group differences of all kinds, including those just described, and that they are 
a factor in the political, social and economic life of the community. By turning 
to the first section of your Hometown directory where information about City 
Hall and other city services is listed, a card file can be made on the political 
structure of Hometown. A study of this file will show that it reflects some of the 
divisions within the community, as already indicated in the foregoing, 

The life of most communities is still dominated to a large extent by the 
oldest inhabitants and their descendants. Usually they exercise the chief influence 
on the boards of local banks, and insofar as businesses are locally owned they 
tend to retain a controlling interest. But today this is by no means a universal 
pattern since the influence of out-of-town corporations has become more and 
more important in all but the leading cities of the country. As a first approxima- 
tion we can say that the social system of a community (with its various organiza- 
tions, such as the Community Chest, Red Cross, etc.) leans heavily upon the 
families of those long established in the community; the economic leadership is 
drawn from the ranks of newer individuals and groups; and the political leader- 
ship is even more frequently in the hands of the representatives of newer groups. 

To understand the social system of the community insofar as it can be dis- 
tinguished from the political and economic systems, it is usually necessary to 
begin with the churches as the oldest, local, social institutions. These are in turn 
distributed throughout the community, and a map showing their location will 
be very instructive and assist in pointing up some of your other findings. Since 
most churches are not only religious but neighborhood social organizations, 
incorporating many nonrefigious activities, a study of the leadership of these sul> 
groups within the church will also contribute to your understanding of the 
Community. 

Any young businessman coming to the city for the first time and expecting 
to establish residence there will want to know about the other social and charitable 
Organizations, their functions and their representation of various groups. Some 
of these can be called “total community" organizations, for instance the Com- 
munity Chest, or, within the business community, the Chamber of Commerce. 
Others are representativ e of special groups in the community. One of the most 
significant keys to the social grading system is to be found in the structure of 
the more exclusive dubs. There is usually a club to which only the older inhabi- 
tants are admitted, and their method of choosing even among tills older group 
displays their attitude as to the necessary qualifications for membership in the 
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inner circle of their group. If you can secure a list of their membership and 
compare it with other parts of the social structure, such as leadership in total 
community organizations, churches, charities, and clubs, you will have a use 
key to the relationship between the older and newer groups in town. _ 

In this type of study it is easy to lose sight of the fact that getting a living « 
the backbone of community life and that the jobs held by men and women are 
bound to be the ruling factors in their lives. The increase of absentee ownership 
of factories and stores, and even of newspapers and banks m towns and ernes of 
the United States, makes it more and more difficult to understand flu .pattern 
of organization of individual communities. To find out what is appen , 

plants and businesses you need new tool volumes. For industries, consu 
(or Poors) Irulustriob. One or the other of these will give you a pic ur 
extent to which your local factory is still locally controlled, or t° ^at extent 
control has passed to out-of-town groups. While these sources ™ 
our present purposes, if a really extensive study of these ma ers ' V p , ' 

it would be nerLary to consult the records of the Seconder and 
mission in Washington. If the language used is unfarn ar, s 
who has experience with reading such source mateml should be ca P f 
help. Hereyou may find that the local company (though sb ,1 1 ' * ^ “J 

perhaps undergone a series of mergers prior to arriving a P 
are interested in the historical aspects of ,l,e ccmmun.ty it wdl worm 
studying the companies that merged to fonn the exis g • vate j y . owne d 
local industries is paralleled by what has happened to you, JX 

electric light, gas, water, and street transportation system or ^system . A ^ 
tool volume is available for investigation of these complies “ 1,1 J d J 
Public Utilities. All of these facts should be recorded on file cards ana p p r 

catalogued. Gathered a very comprehensive 

In a short space of time you will no g ity 0 f course 

picture of the life of Hometown. (T le size 0 ^ muc }, WJ jj depend upon 
determine the length of time this job wi , nture to say that you will 

your previous experience in making such s )_ escaped >our notice 

know many things about V- 

even if you had lived there all your • most important facts wluch 

cannot find out by these mechanics rn • u js essen tial to live long 

tend to elude this approach are o P f th st important of 

years in a community in order to be had 3 sl)0 rt residence in 

these facts. As an outside observer, or e e ^ ^ adequate substitute for 

the community, you cannot hope o , your company, you are expected 

this experience. However, as ' eP ^ n bIe substitute for such long residence 
to find a short-cut wluch will be h P . , ^ to gain access to certain 

The best substitute for your own ^ l}iere are certam people in 

of the oldest inhabitants. Experience socially elect group but frequently 

the community, not always the patterns »» !»«* 

drawn from among them, whose typ , , methods. Frequently these 

jus. described, without resort to “ pM „io„s in ,1m corn- 

individuals have recently T 

»T,heVre%ec. yon - a scientific in-tigator and believe >ou luxe 
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the best interests of the community at heart, they will often be of invaluable 
assistance to you in providing that type of information which can generally come 
only from a lifetime knowledge of their town or city. You will do well to try 
to End at least one such person (and if possible several) who is willing to assist 
you and who will talk to you freely. Hy working with more than one of these 
individuals you will be able to triangulate your results and so avoid some of the 
inevitable effects of bias. Even the most objective of these persons is bound to 
see the life of his community from a slightly different vantage point than would 
any other such observer. You must learn to make allowances for these differences 
of point of view. 

If you, the reader, are yourself a lifelong resident of Hometown, you will be 
able to supply the same sort of information as this oldest inhabitant, and you 
will want to correct your bias by die methods I liavc described. Nevertheless a 
lifelong resident will have a great advantage over our supposed representative 
of an out-of-town company. You will know, for example, who married whom, 
and what the grandparents and even the great-grandparents of many of >our 
fellow residents contributed to the life of the community. WiUiout having to 
make maps of the historical growth of the community, >ou will know at what 
point in time which suburbs developed, and just when different local businesses 
came under out-of-town control. 

Without referring to the files of the local newspapers (which by the way 
are indispensable to our visitor-observer) or having to talk to the local newspaper 
editors, you will know just which events in the life of Hometown are of the 
greatest importance in the estimation of its citizens. You will know when crises 
arose in the life of the community and how they were solved. You will know 
what effect the depression and mass unemployment had on the town or city 
and what happened when labor tried to organize the local plants and businesses. 
Above all, you will know the personalities and dispositions of the human beings 
who make up Hometown, and you will realize what an important part such 
personal traits can play in the average community. In short, these mechanical 
methods I have recommended are bound to produce a de-humanized picture. It 
is essential for the observer to try to restore the characteristics of a living com- 
munity, with its hopes and fears, its shared pleasures and its sorrows. 

One of the most elusive things you must try to understand is “community 
spirit,” and in this connection you must attempt to discover what individuals or 
groups in the community hold the symbols of community leadership at any 
given time. 

Outside of the churches there are two places to look for the symbols of com- 
munity integration. One of these focal points is the cemetery and the other is the 
patriotic organization. No community, modem or ancient, can be understood 
without reference to these two sets of facts. It has been said that “the most 
important people in Hometown are dead " Even to American society, with its 
gaze fixed upon expansionism and the future, ancestors are of great symbolic 
importance. If ancestors in general are important, those who participated in our 
military history are of great significance. Certain patriotic organizations in each 
community tend to be regarded by the rest of the community as safeguarding 
the symbols of patriotism. It is necessary to study the structure of these patriotic 
organizations as an important factor in the advancement of Hometown life, to 
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be present on the day of their most symbolic activity, Memorial Day, and watch 
the course taken by the parade— from High Street down through the business 
section of the town and out to the oldest cemetery. This will usually contribute 
many useful facts about the nature of community spirit. 

All this is bound to sound hke an overwhelming job; for the largest cities, of 
course, it is much too great an undertaking for any single individual to carry out 
in a reasonable period of time, even as a first approximation. Nevertheless, if you 
will re-read these proposals with care you will find ways and means for short- 
cutting and sampling, depending upon the size of your community; after operating 
with this outline for only a few weeks, the characteristics of your community 
will take on new significance even if you have lived there all your life- 
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Behavior, human, theories of, 413 
Behavioral science man, 416-19 
Behaviorism, 30 

Bellman's Principle of Optimality, 
309-10 

Binary numbers, 116-18 
Binary splits, in trouble-shooting, 264- 
67 

Bits, 117, 119 

per unit time, channel capacity 
measured in, 150 
Black box. 29. 88, 238 
defined, 29-36, 31 
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Black box assumptions, of compati- 
bility of outputs to inputs, 33 
of independence, 33-33 
of stability of transformation, 31-32 
Black box transfer function, defined, 

35 

for feedback loop, 45—16 
Black box transformations, 20, 30-31, 
33-38 

by analogy, similarity, and modifica- 
tion, 30-31 
combining, 33-37 
and observation and experiment, 31 
by specification, 30 
with time delay, 37-38 
Block diagram, 27, 34, 68 
Block substitution, in trouble- sbootin g, 
270-72 

Boolean algebra, using designation 
numbers, 320-30 
Break-even chart, 161, 162 

Canonical table of possibilities, 322- 
23 

Capability, and intelligence questions, 
223-30 

long-range studies of, as higher- 
order simplification, 230 
present and potential, 218-22, 226 
Cargo shipments, constraints limiting, 
225 

Central Intelligence Agency, 249 
Central Limit Theorem, in statistics, 
170 and ft., 389 

Change, cultural, arguments for and 
against analysts concern with, 
423-24 

introduction of, 420-22 
Channel capacity, in bits per unit time, 
150 

Classification, in systems definition, 87, 
89. 91-93, 95-102 
use of, 129-30 

Clustering effect, 163. 167, 170 
in politics. 229 

in omptom-ckuse-complex data, 
33S 

COBOL. 126 n. 383. 505 


Code(s) in commercial use, 494-505 
block, 496-99 

dangers in, and enor detection and 
correction, 500 

development of, from message or 
item contest, 500-01 
documentation of, 504-05 
hierarchies in structure of, 499-500 
in integrated data-processing sys- 
tems, 501-04 
selection of, 494-98 
serial, 496-99 

Combinations, improbable, 189 

logically or statistically restricted, 
in elimination, 162-66 
Combinatorial numbers, 14-17, 22 
Complete logical basis, 323, 334 
Computer, analog, 363 n., 365, 367 
electronic digital, see Electronic 
digital computer 
Computer flow diagram, 34 
Computer programs, standard, as 
simulators, 367 

Computer simulation languages, 389- 
93 

Compute-tally-test sequence, in logical 
operations, 39 

Conflict, in decision making, 203-10 
passim 

multiple goal, 210-11 
Constrained logical basis, 330-33 
Constraint refinement, and goal, 451- 
53 

Continuous functions, notation for, 
487 

Contour map, 176 

Control, and Ashby’s Law, 138, 143, 
148 

Control sj stem hierarchies, and sys- 
tem memories, 111-30 
Conversion chart, six-bit coded deci- 
mal (IBM 1401), 117 
Conversion operations, see Transfor- 
mation (s) 

Comer evaluation. Simplex Method 
of, 163 and n. 

Correction operations, 27, 41-51 
Correlation analysis, statistical, 170 



Index 537 


Cultural gap, leadership and organiza- 
tion to bridge, 424-26 
Curiosity, as reason for testing system, 
238-39 

Cybernetics, 24 n. 

Data, distinguished from patterns, 
122-23 

tabular, rearranged into partitioned 
blocks, 194-95 
Databank, 222-30, 447 
Data collection, 93-98, 180, 185, 186 
classification of, 91-93 
consistent category definitions used 
for, 185 

Data-processing systems, integrated, 
use of codes in, 501—04 
Decimal conversion chart, six-bit 
coded (IBM 1401), 117 
Decision block, 38 

Decision making, conflict in, 203-10 
passim 

Decision table, 41, 43, 331, 332 
Derision tree, 39-40, 76, 77, 78 
See also Tree diagram 
Delay line, 121 

Descriptive element, in long-range 
definition, 221 

Designation numbers, used in Boolean 
algebra, 320-30 

Detection at glance, in trouble-shoot- 
ing, 260-61 

Deterministic machine, 321 
Dewey Decimal System, 498 
Dictionaries, and information search 
procedure, 281—82 
Differential analyzer, first, 363 n. 
Dimensions, and extraction, in simpli- 
fication, 163 

Discrete functions, notation for, 479 
Discrimination and tlireshold methods, 
of elimination, 166-74 
Distribution, of system measures, 167, 

168, 169, 170 and n. 

Division, matrix, 468 
Dynamic programing, as search ap- 
proach, 303-10 
DYNAMO, 390 and n., 391 


Economics, efficiency in, meaning of, 
203 

trade curves in, 177-78 
Effectiveness measure (s), 430 
defined, 202 

for departments of industrial orga- 
nization, 204-10 

distinguished from efficiency, 202 - 
03 

and growth of industrial organiza- 
tion, 207-10 

Efficiency, distinguished from effec- 
tiveness, 202-03 

of language in transmitting informa- 
tion, 507 

Electronic digital computer, 121, 122, 
126 n., 156, 195, 321, 435 
storage of symbols in, 116-18 
system simulation by, 365, 367, 369, 
376 

and wide use of databank, 222-23 
Elegance, concept of, 156 
Element sheet, for organizing data col- 
lection, 92, 93, 94 

Elimination methods of simplification, 
157, 158, 159, 160-74 
problems in, 180-90 passim 
Endogenous event, within system, 99, 

100 

Entity, as system component, 90, 98-99 
Entropy, 149, 150 

of source of information, 507, 508 
Event, as change in system status, 99 
Event timing routine, 100 
Exogenous event, outside system, 99, 

100 

Expansion law, for flow graph reduc- 
tion, 66 

Extraction, and dimensions, in simpli- 
fication, 163 

Feedback, 42-44 
first-order, 111-12, 127, 129 
fourth-order, 1 23 
with lag, 47-51 
without lag, 45-47 
ncgatne (no lag). 45 
positive (no lag), 40—17 
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Feedback ( continued ) 

second-order, 113-15, 125, 128 
third-order, 128 
Fibonacci numbers, 300 n. 

File organization, &7—00 
Fde scope, and flexibility, 447—19 
File search methods, 275-78 
Filter, 157 

First canonical form, for concerting 
designation number into symbolic 
statement, 330 

Flow diagram, 18, 19, 23, 34, 42, 92 
relationships of, 20 
Flow graph, 64-80 
construction of, 65-86 
equivalent matrix of, 68-69 
of Keynes' theory of capital move- 
ment, 63 

and lag and time delay, 67-68 
and network flow problems, 78-80 
and reduction methods, 66-87 
state-descriptive, tcc State-descrip- 
tive system 

FORTRAN. 126 n., 389. 390, 392 
French Revolution, as system break- 
down, 228 

Coal(x), and constraint refinement, 
451-33 

defined, 201-02 
multiple, conflicting. 210-11 

Goal dunging storage, 122, 125-26. 
128 

Governor, steam, as feedback device, 
43-44 

GPSS-IU, 3^0.201. 321 

Graph theory, nodes and arcs in, 10 
Grouping methods of simplification, 
157. 15S. 159, 160, 174-80 
logical and arbitrary. 175-79 
problems in, 180-90 passim 
prototypes used in. 170-80. ISO 
statistical. 174 

Cuessmg. with logical proof, 318-20 

H aid* .ire »> stems, 2, 171 and n. 
llelu of Kiencc. 231 
Heuristics. 17 and n. 


Hierarchies, control system, and sys- ~ 
tem memories, 111-30 t 

Hierarchies in system definition, &6- : 

87, 90 : 

arbitrary or artificial, 88-89 
natural, 89-90 

Higher-order simplification methods, 
157, 200-31 

and high-level changes, 215-18 
and Information Inventory Sheet, 
212-15, 216 

as long-range capability studies, 230 
Hill climbing methods, in search with 
partial information, 301-03 
Homeostasis, ll2n. 

Homeostat, 112 n. 

Hough patent, 475 

Human behavior, theories of, 413 

1BSYS Control System, 126 n. 

Identity, logical, 327 
Implementation, anal>sis for, 413-39 
anal) st’s own resources in relation 

to, 437-38 

and attention and awareness func- 
tions of sjstem, 431-32 
boundary problems in, 437 
cffcctiv encss measures in, 430 
operating standards developed be- 
fore, 432 

organization of data for, 426-39 
and problem anticipation file, 427- 
39 

problem* of, 422-26 
and scope of S) stem, 438-39 
and updating provision*. 433-34 
Implication, logical, 328-29 
Independence, creation of, in trouble 
shooting. 26$ 

Indexing, and information search pro- 
cedure, 2S0-8I 

Indifference, zones of, in simplification. 

Indifference curves, 176, 177, 178 
Industrial revolution, 23 
Infinna specie*. in logic. 86 
Information, complete inside, about 
structure and function. 252-54 



Index 539 


internal probing for, 246-51, 252 
measures of, 118-20, 506-09 
partial, range of possible tests nar- 
rowed by, 251-52 

Information handling, improved, 446- 
51 

Information Inventory Sheet, 213-15, 
216 

Information processing, creative, 450- 
51 

Information search procedure, 272-82 
with relaxed conditions, 278-80 
Information theory, 119 »i., 245 
and Ashby’s Law of Requisite Vari- 
ety, 149-50 
Input, 19 

compatibility of output to, 33 
Inside tests, 246-51, 252 
combined with outside tests, 251-52 
versus outside tests, 241-45 
Institutional process, 414, 415-16 
Instrumental process, 414-15 
Intermittent failures, trouble-shooting 
methods for, 267-68 
Inventory control, ABC systems of. 
168, 169 

Inversion, matrix, 468, 469, 470 and 
471,472 
Iso-lines, 176 

Isomorphic systems, 15, 16, 17 
Item-characteristic file, in information 
search procedure, 272-73 
Item retrieval, in information search 
procedure, 273-75 

Job shop, simulation of, 370-77, 393— 
403 

Keynes’ theory of capital movement, 
mechanical analogy of, 364 

Lag, feedback with, 47-51 
and flow graph, 67-68 
Language, computer simulation, 389— 
93 

simplified instruction in, 167 
structure of, 118, 119 


Laplace transform, 47, 478 n., 486-91, 
492 

Learning storage, 122, 125-26, 128 
“Least Effort, Principle of," 168 
Linear programing, 164-66 
Linear system, response of, to sinu- 
soidal input, 242 n. 

Link Trainer, 366 

Logarithms, used in simplification, 
191-92 

Logic, hierarchyin, 86 
and probability, in systems diagno- 
sis, 31&-52 

See also Boolean algebra 
Logical addition, 323-25 
Logical identity, 327 
Logical implication, 328-29 
Logical multiplication, 325 
Logical negation, 325-26 
Logical operations, 27, 38-41 
combining, 41 

Logistics, new, of information han- 
dling, 446-47 

Long-range planning, trend in, 451-52 
Loops, 21, 38, 40 


Machine, deterministic, 321 
probabilistic, 321 
simple, 127, 129 

Management theory, and systems anal- 
ysis, 7-10 
Map, contour, 176 

road, system elements and relation- 
ships m, 13, 14 
Markov processes, 508 
Masking matrix, 333, 334 
Mathematical transformation (s), 473- 
93 

of Cartesian xy coordinates of point 
to polar coordinates, 474 
complex. 477-91 
line-point and point-line, 475-77 
simple, 474-77 
Matrix, masking, 333. 334 
order of, 457 
transition, 463 n. 

Matrix descriptions, of system, 21-22 
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Matrix display, from coded transaction 
data, 96 

Matrix operation, 37, 457—7 2 
of addition, 457, 459 
identity I in, 468, 469 
of inversion, 468, 469, 470 and n., 
471, 472 

of multiplication, 459-64, 470, 471— 
72 

in solution of equations by detached 
coefficients tableau method, 466- 
67, 469, 470 n. 

in solution of simultaneous linear 
equations, 464, 465-66 
of subtraction, 457, 459 
Maximum capacity, of network to han- 
dle How, 78-79 
McBee Keysort System, 89 
Meat-packing operations, analysis of 
capabilities of, 224-25 
Medicine, angiograms and arterio- 
grams in, 252 

and human eye as “window” in sys- 
tem, 247 

radioactive tracers used in, 243—14 
and symptom-cause-complex table, 
339-41, 342-51 
Memory, 115-29 

media used for, 120-21 
organization of, 122-20 
reorganization of, and learning, 
126-27 

size of, 120-21, 124 
symbols used in, principles for, 
116-20 

See also Storage 
Memory box, 123, 124 
Memory capacity, and speed of pro- 
cessing, 14&-49 

Memory storage equipment, 113 
Military application, of trade and in- 
difference curves, 177-78 
Minimum project time, and network 
flow, 79 

Modular construction, 442-46 
benefits of, 443-44 
trade-offs when using, 444 — 16 
Monogram calculation, 508 


Monte Carlo Methods, 377, 378, 381, 
402, 405 

Multidimensional state diagram, 73- 
74,75 

Multiple goal conflict, 210-11 
Multiplication, logical, 325 
matrix, 459-64, 470, 471-72 
Multiplication Law of Probability, 381 
Multiplication rule, for black box com- 
bination, 35 

for flow graph reduction, 60 
Multiway branches, for logical selec- 
tion, 39-40, 41 

Negation, logical, 325-26 
Negative feedback (no lag), 45 
Network flow problems, 78-80 
Nodes and arcs, in graph theory, 16 
Noise, channel capacity reduced by, 
150 

Nonlinear distribution of activity, 168, 
169, 170, 171 
NORAD, 433 

Notation, for continuous functions, 
487 

for discrete functions, 479 
Standard Matrix, 458 
symbolic, 321-22 
Novelty storage, 122, 125-26, 128 

Operationahsm, in physics, 30, 32 
Operations research models, as simu- 
lators, 367 

Optimality, Bellman’s Principle of, 
309-10 

Optimization, 114, 159, 164, 165, 202, 
297, 303 

Order-processing system, 18, 20 
Oscillation, 46, 211 

Outcome, of combination of system 
action and analyst reaction, 137, 
139, 140. 141, 142, 143 
Outcome table, 137, 139, 142, 144, 
149 

Output, compatibility of input to, 33 
prediction of, 19, 239 
Outside tests, combined with inside 
tests, 251-52 
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danger in using, 245-46 
for input-output balance, 243, 244 
versus inside tests, 241-45 
as response tests, 241-43, 248 

Paperwork transactions, measurement 
of frequency of, 166-87 
Parsimony, Principle of, 156 
Partitioning, as simplification proce- 
dure, 194-95 

Pattern storage, routine, 122, 123 
Patterns, distinguished from data, 
122 - 2 3 

Peak load conditions, study of, 169 
Permanent entity, in SJMSCBIPT ap- 
proach, 98, 99 

Poisson distribution, 286, 287 tt. 

Police force, Chicago, reorganization 
of, 218 and n. 

Politics, clusters in, 229 
Positive feedback (no lag), 46-47 
Potential and present capability, prob- 
lem of, 218-22, 226 
Prediction, 19. 127-28, 239-40 
first-order, 111-12, 127, 129 
second-order, 113-15, 125, 128 
and stability, statistical, 180, 181 
and variability, 180 
See also Feedback 

Present and potential capability, prob- 
lem of, 218-22, 226 
Probabilistic machine, 321 
Probability, 75-78 

and logic, in systems diagnosis, 
318-52 

Multiplication Law of, 381 
Probability tables, diagnostic, 341-51 
Probe, internal, 246-51, 252 
Probe effect, 248, 250 
Probe points and probe types, selec- 
tion of, 248-51 

Processing ability, cost of, versus cost 
of simplification, 155-57 
and memory capacity, 148-49 
Profit, goal to maximize, 202 
Prototypes, in grouping methods of 
simplification, 179-80, 168 
Pseudo-random numbers, 387 


Purposeful systems, 24 

Questions, intelligence, and capability 
problems, 223-30 

Radio, amplitude modulated signals 
in, 173-74 
oscillators in, 47 

Radioactive tracers, used in medicine, 
243-44 

Random number generation, computer 
comments on, 387-89 
Random numbers, tables of, 378, 379, 
385, 386, 387 
Random sample, 166 
Redundancy, 118, 126 n., 508, 509 
Reference group, concept of, 226, 227 
Regression analysis, statistical, 170, 

171 

Reportorial element, in long-range 
definition, 221 

Requisite Variety, Ashby's Law of, 
see Ashby's Law of Requisite 
Variety 

Response test, 241-43, 246, 254 
Restricted ranges of measure or inter- 
est, in elimination, 160-62 
Road map, system elements and rela- 
tionships in, 13, 14 
Routinepattem storage, 122, 123 
Rules, for black box combination, 35- 
37 

for flow graph reduction, 66-67 
for matrix operation, see Matrix op- 
eration 

Russian Roulette game, 381-82 

SACE system, simulated tests of, 366 
Sampling, from continuous distribu- 
tions, 384-87 

from discrete distributions, 377-84 
Search problems, 259-310 

dynamic programing as approach 
to. 303-10 

and extreme values, 292-94 
with partial information. 297-303 
random, 282—94 

and unimodal optimum, 297-301 
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Search problems (continued) 

See also Information search proce- 
dure; Trouble-shooting methods 
Short and continuity checks, 261-62 
Signal substitution, 263, 272 
Signal tracing, 262, 263, 272 
Simple machine, 127, 129 
Simplex Method, of comer evaluation, 
163 arid n. 

Simplification, cost of, versus cost of 
processing ability, 155-57 
direct methods of, 157-95 
by elimination, see Elimination 
methods of simplification 
by grouping, see Grouping methods 
of simplification 

higher-order methods of, see Higher- 
order simplification methods 
logarithms used in, 191-92 
by partitioning, 191-95 
principle of, 17 

by substitution and transformation 
of variables, relationships, and 
parameters, 190-93 
symmetry as procedure in, 193 
variable-to-attnbute method of, 175 
zones of indifference in, 175-77 

sim script, 93-102, 390 and 

391,392 and n„ 393,103 
Simulation, 358-407 
advantages of, 359 
analog, 360-64 

blackboard exercise in, 393-403 
and computer languages, 339-93 
constraints on, 367-69 
electrical and mechanical (analog), 
360-ei 

and evaluation of alternate system 
designs, 400 

example of technique in, 370-77 
and “exercising the system," 403 
and heuristic selection of lest cases, 
404-05 

manual, 359-60, 369, 372, 373 
375-77 

mechanical and electrical (analog) 
360-61 

operational, 366-67, 404 n. 


and parametric variation of test 
cases, 405 

realistic, 358-67 

and selection of test variations, 
403-06 

symbolic, 365-66 
and validation of system, 403-04 
variability introduced into, 377-403 
Skill files, 449-50 

Social science, and technological 
change, 414-22 

Sociology, reference group of, 226-27 
Software systems, 125, 171 and n. 
Specification, principle of, 14, 17 
Speculative element, in long-range 
definition, 221 

Stability, statistical, 181 and n„ 182, 
183 

and consistency, 183-86 
and prediction, 180, 181 
and variability, 181, 183, 186 
Standard computer programs, as simu- 
lators, 387 

Standard Matrix Notation, 458 
State, defined, 69 

State-descriptive system, 89-80, 98, 
100 

deterministic, 70, 71, 72, 73 
matrix analysis of flow graph in, 
72-73 

probabilistic, 71, 72 
and steady-state conditions, 73 
and transitions, 70-71 
Stale diagram, multidimensional, 73- 
74, 75 

Statements, in logical manipulations, 
321-23 

Statistics, Central Limit 'theorem in, 
170 and n., 389 
correlation analysis in, 170 
data reduction in, 174 
regression analysis in, 170, 171 
Sec also Sampling; Stability, statis- 
tical 

Storage, 115, 116, 121, 447 
types of, 122-26 
See also Memory 

Stratification, and variability, 186-90 
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Stress methods, in trouble shooting, 
268-70 

Subtraction, matrix, 457, 459 
Summum genus, as system, 86 
Supervisory control program, 126 n. 
Symbolic notation, 321-22 
Symbols, used in memory, principles 
for, 116-20 

Symmetry, as simplification proce- 
dure, 193 

Symptom-cause-complex tables, 334-37 
logical diagnosis from, 337-38 
tests from, 338-41 
System (s), complexity of, 136, 138 
constraints in, 157, 158, 159, 164, 
211-12, 219, 221 

conversion operations of, see Trans- 
formation^) 

correction operations of, 27, 41-51 
dynamic response of, to stated input 
(trajectory), 48-49 
elements of, 17-18, 19, 20, 87 
equilibrium condition of, 49, 73 
as flow diagram or network, 17-18 
hardware, 2, 171 and n. 
hierarchy of, 127-29 
history of descriptions of, 1, 22-24 
improvement of, 442-53 
inputs and outputs of, 19-20 
isomorphic, 15, 16, 17 
learning in, 122, 125-27, 128 
logical operations of, 27, 38-4 1 
matrix descriptions of, 21—22 
and measurement of relationships 
and element properties, 20 
memory characteristics in, see Mem- 
ory 

as network or flow diagram, 17-18 
open-end, 127 
order-processing, 18, 20 
purposeful, 24 

reasons for testing, 237-4 1, 246 
relationships among elements of, 18, 

20 

simplification of, see Simplification 
software, 125, 171 and n. 

State~desc/ipth e, see State-descrip- 
tive system 


status of, SIMSCRIPT description 
of, 98-100 

steady-state condition of, 49, 73 
strategic, 128 
as summum g enus, 86 
tactical, 128 
teleological, 24 
tests for, see Tests for systems 
transformation operations of, see 
Transformation (s) 
transient condition of, 49 
use of classification of, 129-30 
variety of, see Ashby's Law of Req- 
uisite Variety; Variety 
Systems analysis, and complexity, 136, 

138 

for implementation, see Implemen- 
tation 

inelegant but efficient, 156 
as institution, 3-7 
and management theory, 7-10 
new, 1-3 

purposes of, 237-41 
as scientific method, I 
search problems in, see Search prob- 
lems 

and variety, see Ashby's Law of 
Requisite Variety; Variety 
Systems definition, 13, 136, 156, 181, 
239 , 240 

circular process of, 230-31 
classification in, 87, 89, 91—93, 98- 
102 

constraints in, 157, 158, 159, 164, 
211-12,219.221 
on dynamic basis, 98 
hierarchies in, see Hierarchies in 
system definition 
higher-order approach to, 201-15 
mechanizing, 91-98 , 100 
objective of, 14-17 
Systems and Procedures Association, 4 


Tabledex Method, of file organization 
and search, 276-78 

Tabular data, rearranged into parti- 
tioned blocks, 194-95 
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Technological change, and social sci- 
ence, 414-22. 

Telemetry, conversion steps in, 52 
Teleological systems, 24 
Temporary entity, in SIMSCRIPT ap- 
proach, 98, 99 

Temporary event notice list, 100, 101, 
102 

Tests for systems, on group of possi- 
bilities, 294-97 
inside, see Inside tests 
outside, see Outside tests 
reasons for, 237—41, 248 
See also Search problems 
Thermodynamics, Second Law of, 245 
Thermostat, 111—113 
Thesaurus of terms, and information 
search procedure, 281 
Threshold and discrimination methods, 
of elimination, 166-74 
Time delay, 67-68 

transformation with, 37-38 
Time-position chart, in system simula- 
tion, 372-74 

Total System, 5, 204, 207, 208, 224 
Trade curves, 177-78 
Traffic patterns, study of, 172-73 
Transaction coding structure, 92, 93 
Transaction sheet, 96 
Transfer function, black box, defined, 
35 

Transform, Laplace, 47, 478 n., 486- 
91,492 
matrix as, 470 
=. 47, 477-86, 492 

Transfonnation(s), 20, 27-38, 191- 
92 

mathematical, see Mathematical 
transformation (s) 

matnx multiplication as, 470, 471- 
72 

Transition matrix, 463 n. 

Transition tree, 75 

Transmittance, in flow graph analysis, 
64 

Tree diagram, and sequences of transi- 
tion, 74-78 
See cho Decision tree 


Trial boundaries, area of system in- 
terest limited by, 161 
Trouble-shooting methods, 260-72 
Truck routing, study of, 175 
Truth tables, 322-23 

Uncertainty principle, in physics, 24 
Unimodal optimum, search for, 297- 
301 

Univac 409 Computer, 7 

Variability, introduced into system 
simulations, 377-403 
and prediction, 180 
and statistical stability, 181, 183, 
180 

and stratification, 186-90 
Variable (s), in relationships of flow 
diagram, 20 

substitution of, in simplification, 
190, 191 
Variety, 136-67 

coded, requisite rate of, 146—17 
rate of, 145—10 

See also Ashby’s Law of Requisite 
Variety 

Vector inputs, transformation of, 37 
Vending machine, logical flow dia- 
gram of, 40, 42 
Venn diagrams, 323-325 

Wiener, Norbert, 4, 429 
World War II, morale studies during, 
227 

“Operation Mincemeat" of, 228 and 
n. 

selection of German ball-bearing in- 
dustry for destruction in, 220 

X-bar chart, used in industrial quality 
control work, 170 

z transform, 47, 477-86, 492 
Zip Code, postal, 500 
Zipfs law, 168, 169 
Zones of indifference, in simplification, 
175-77 



